Ministry of Justice 
Patent OfHce 



I : 




STATE OF ]SfLAEL 



4 



IS DEC 2000 




This is to certify that 



annexed hereto is a true 
copy of the documents as 
originally deposited with 
the patent application 
of which particulars are 
specified on the first page 
of the annex. 



PRIORITY 
DOCUMENT 

SUBMITTED OR TRANSMITTED IN 
COMPLIANCE WITH RULE 17.1(a) OR (b) 



■'3 miynV riKT 



This 



1 3 -11- 




Dvn 



Commissioner of Patents 




Certified 



for OtVicc r 


V umber 




28-12' ]S99 




Dale 




: nm3/o7pin 







1967 - :"3\yn ,a>v>3uon pin 

PATENTS LAW. 5727-1967 

t 9 

Applicanon For Patent 



'N.inic and address 01 applicam and nunse .fh . ^'v .v.^anno^y; , 



Riyht of Law 

of an invenrion the :itle of which is 



V7n nan nN5f>3N 

Owner, bv \ inue of 



Novel nucleic acid and amino acid sequences 



I Icrohv iipph t or a paie.ii lo he ■jrantcd lo „ic j ,, respect tlicreol 

r^^^t*** «««ti«. - I — — — 



Application of Di\ iMon 



No. 



Dated 



from application 



Ot7i» nvpa • 

Appl. ibr i^ucnt of Addition 



No. 



Dated 



to l*atcnt/App|. 



P.O.A.: General . 

filed in case P128131 



i?N-)>y>a Q^DntJo J)-i^t?»b )von 

Address for Service in Israel 

REINHOLD COHN AND PARTNERS 
Patent Anomevs 

P O.B. 4060. fe!-Aviv C. 122052 



p. , . .. Signature of Applicant 

For the Applicants. 

REINHOLD COHN AND PARTNERS 
By : — 



mis .orm. impressed with the Seal of thM^Stem 
which are set out above. 




Number/Mark 



Priority Claim 



(Hebrew) 
(Enizlish) 



Dale 



Convention Country 



i999__ ji3>y December xynna 97 

Year 



For Office Use 



This 



Delete whatever is inapplicable 



No\ cl nucleic acid and amino acid sequences 



Compugen Ltd. ' ' 



C. 122052 



- 1 - 



I 



FIELD OF THE INVENTION 

The presen, invention concerns novel nucleic acid sequences, vectors and 
host cells containing them, amtno acid sequences encoded bv said sequences and 
ant.bodies reactive with said amtno acid sequences, as well as phatmaceutical 
5 compostttons comprising any of the above. The present invention .Itrther conce-^s 
methods for screening for candidate activator or deactivators utilizing said amino 
acid sequences. 

BACKGROUND OF THE INVENTION 

Bone motphogenetic proteins (BMPs) are membetj of the TGS-B super 
,0 famtly of pepttdes which have conserved carboxy-teriminal re.tons containing 
se^■en cysteine repeats. Multiple BMPs or osteogenic proteins fOPs) have been 
described including BMP-2, -3 (or osteogenin), -4 -5 -6 -7 (or OPr) and -S (or 
0P2). BMPs are known for their role in embtyonic development and differentiation 
and have also mitogenic properties for skeletal cells inducing the differentiation of 
mesenchymal cells into osteogenic cells as well as enhancement of the 
differentiation of osteoblast. TTtey are also known to increase collagen svnthesis 
and inhibit collagenase 3 expression by osteoblast. 

Chordin is a secreted glycoprotein with a molecular mass of 120 kD and is 
synthesized by the Spemann organizer of the amphibian gastrula. Chordin mimics 
20 the action of the Spemann organizer which can induce the formation of neural 
tissue from eaodem, and induce dorsalization of the ventral mesodem, to form 
muscle. These two activities are opposed (i.e. antagonized) by BiMPs and chordin 
binds and blocks the action of BMP.2 and BMP^ by preventing receptor bindin« 
It has been posmlated that chordin may be expressed b,. cells of the osteoblasf'c 
25 lmeage_,o limitBMP actions in the o steoblast. TTtis would b. . c H..., «,„^i„„ ^ . 
a BMP bmdmg protein since excessive BMP-1 has been implicated in pathogenesis 
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of nbrodysplasia ossificans progressiva. Ii has been posriiiaied :hat BMPs can 
cause mduciion of noggin and chorcin niRNA and proiem ie\"eis :n skeleian cells 
b\' :rasncripiionai meciianisnas. and m rum t!:ese prev ent the elbeci of BMPs in 
osteoblast m a negati\'e-t}"pe feeback. The induction of these proteins by BMPs 
appears to be a mechanism to ninit the BMP efrect in bones (Gazzerro or c;:.. J. 
C::n. InvesL, 102( IZiiZlOo-IlU i 1998)). 



GLOSS.AJRY 

In the following description and claims use vvill be miade. at tim.es. w'wh a 
\ anet}- of tenns. and the meaning of such terms as the>- should oe construed in 
accordance with the in\'ention is as follows: 

''C/wrdin like homolog (CLH) nucleic acid sequence'' - the sequence shown in 
an\ one of SEQ ID NO: 1 to 4. sequences having at least 70° o identit}- to said 
sequence and fragments of the above sequences being 20 b.p. long. Those 
sequences are sequences coding for a novel homolog of the known Chordin 
protein, as well as for variants of the novel homolog produced by alternative 
splicing. 

The sequence snown in SEQ ID NO: 1 is a homolog to the known chordins 
within the VWTC domain, named after the von-Willebrand factor (VWF) type C 
repeat, w^hich is found 2-4 times in these multi-domain proteins. The VWF 
domain has a length of about 70 amino acids covering 10 well conserved cysteines. 
The presence of this region in complex-forming proteins leads to the assumption 
that the V\VTC domain might be involved in forming larger protein complexes. 
The homolog is a part of a longer sequence termed hereinafter ''full sequence '\ Tne 
full sequence has three naturally occurring splice variants which are also termed 
CLH. The first vanant (SEQ ID NO: 2) has 5 out of V^VTC domains of the iaiown 
chordin. The protein coded therefrom contains a predicted signal peptide. The 
second variant (SEQ ID NO: 3) and third variant (SEQ ID NO: 4) contain 3 out of 
4 VWTC domains of the known chordin. but is not predicted to contain the signal 
sequence. 



Hovvever. the term CLH does not necessarily signify' rhat CLH protein coded 
bv the above sequences (including the variant sequences) has the same or even 
similar physiological effects as known Chordins. merely that it shows sequence 
homology- with the known Chordin. 

'^I'arianf^ - a sequence produced by alternative splicing of chordin like homolog of 
SEQ ID NO: 1. These sequences are not merely truncated forms of the full 
sequence, or modifications of the SEQ ID NO: 1 but rather nararally occurring 
sequences resulting from various alternative splicings. 

''Chordin like homology product (CLH product) - also referred at times as the 
''CLH protein" or "'CLH polypeptide'' - is an amino acid coded by any one of 
SEQ ID NOS: 1 to 4\ The amino acid sequence may be a peptide, a protein, as well 
as peptides or proteins having chemically modified amino acids (see below) such as 
a glycopeptide or glycoprotein. An example of an CLH product is shown in any 
one of SEQ ID NOS: 5 to 8. The term also includes analogues of said sequences in 
which one or more amino acids has been added, deleted, substituted (see below) or 
chemically modified (see below) as well as fragments of this sequence having at 
least 10 amino acids. 

"Nucleic acid sequence'' - a sequence composed of DNA nucleotides, RNA 
nucleotides or a combination of both types and may includes natural nucleotides, 
chemicallv modified nucleotides and synthetic nucleotides. 

"Amino acid sequence'' - a sequence composed of any one of the 20 naturally 
appearing amino acids, amino acids which have been chemically modified (see 
below), or composed of synthetic amino acids. 

"Fragment of CLH product" - a polypeptide which has an amino acid sequence 
which is the same as part of but not all of^e amino acid sequence ofThe CEH 
product. 



''Fraginents of CLH nucleic acid sequence^' a continuous portion, preierably of 
about 10 nucleic acid sequences of the CLH nucleic ac:d sequence. 

''Conserrative substitution'' - refers to the substitution of an amino acid in one 
class b\' an ammo acid of the same class, where a class is deilned p\- comimon 
ph'v sicochemicai amiino acid side chain properties and high substitution 
frequencies in homologous proteins found in nature, as determined, for example, 
bv a standard Davhoff frequency exchange matrix or BLOSL'M matrix. [Six 
general classes of amino acid side chains have been categorized and include: 
10 Class I (Cys): Class II (Ser. Thr. Pro. Ala. Gly): Class III (Asn. Asp. Gin. Glui: 
Class W (His. .Arg. Lysj: Class V (He. Leu. Val. Met): and Class \\ (Phe. T\t- 
Trp), For example.- substitution of an Asp for another class III residue such as 
Asn. Gin. or GIu. is a conser\ ati\'e substitution. 

'*\on-conser\^ative substitution'' - refers to the substitution of an amino acid in 
one class with an amino acid from another class: for example, substitution of an 
Ala. a class II residue, with a class III residue such as Asp. Asn. Glu, or Gin. 

''Chemically modified" - when referring to the product of the invention, means a 
20 product ( protein;) where at least one of its amino acid resides is modified either by 
namral processes, such as processing or other post-translational modifications, or 
by chemical modiilcation techniques which are weii known in the art. Among 
the numerous known modifications t}^pical, but not exclusive examples include: 
acet\iation. acvlatiom amidation, .ADP-ribosylation. glycosylation, GPI anchor 
25 formation, covalent aUachment of a lipid or lipid derivative, methylation, 
myristlyation. pegylation, prenylation, phosphor\iation. ubiqutination, or any 
similar process. 

''Biologically active" - refers to the CLH product which have, regulatory or 
30 biochemical functions on die same target sites which naturally occurring CLH 
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influence, for example can bind to the same receptor as the chordin (or to another 
receptor), can control BIVEPs expression, and regulate the amount of effective or 
biologically available BMP. In panicular the CLH may have effects in osteoblasts 
as well as other cells of mesenchymal origin. 

Immunologically active'' defines the capabiliiy of a natural, recombinant or 
svnthetic CLH product, or an>' fragment thereof, to induce a specific immune 
response in appropriate animals or cells and to bind with specific antibodies. 
Thus, for example, a biologically active fragment of CLH product denotes a 
10 fragment which retains some or all of the immunological propenies of the CLH 
product e.g can bind specific anti-CLH product antibodies or which can elicit an 
immune response \yhich will generate such antibodies or cause' proliferation of 
specific immune cells which produce CLH. 

15 ''Optimal alignment'' - is defined as an alignment giving the highest percent 
identity score. Such alignment can be performed using a variety' of commercially 
available sequence analysis programs, such as the local alignment program 
L.\LIGN using a ktup of 1, default parameters and the default PAM. A preferred 
alignment is the one performed using the CLUSTAL-W program from 

20 Mac Vector (TM), operated with an open gap penalty of 10.0, an extended gap 
penalty of O.L and a BLOSUM similarity matrix. If a gap needs to be inserted 
into a first sequence to optimally align it with a second sequence, the percent 
identity is calculated using only the residues that are paired with a corresponding 
amino acid residue (i.e., the calculation does not consider residues in the second 

25 sequences that are in the "gap" of tlie first sequence). 

"Having at least X% identity" - with respect to two amino acid or nucleic acid 
sequence sequences, refers to the percentage of residues that are identical in the 
two sequences when the sequences are optimally aligned. Thus, 70% amino acid 



secuence idenim- means that of the amino acids in r^-o or more optimalh" 
a!i:^nec polypeptide sequences are identical. 

'Isolated nucleic acid molecule having an CLH nucleic acid sequence'' - is a 

: nucleic acid molecule that includes :he coding CLH nucleic acid secuence. Said 
isolated nucleic acid molecule ma}- include the CLH nucleic acid sequence as an 
independent inseni ma\" include the CLH nucleic acid sequence fused to an 
additional coding sequences, encoding together a fusion protein in which the 
CLH coding sequence is the dominant coding sequence (for example, the 

10 additional coding sequence may code for a signal peptide): the CLH nucleic acid 
secuence may be in combination with non-coding sequences, e.g.. introns o^ 
control elements, such as promoter and terminator elements or 5' and/or 3' 
untranslated regions, effective for expression of the coding sequence in a suitable 
host: or may be a vector in which the CLH protein coding sequence is a 

15 heterologous. 

''Expression vector'' - refers to vectors that have the ability to incorporate and 
express heterolosous DNA fragments in a foreign cell. Many prokaryotic and 
eukar>^otic expression vectors are known and'or commercially available. 
20 Selection of appropriate expression vectors is within the knowledge of those 
ha\'ing skill in the art. 

''Deletion" - is a change in either nucleotide or amino acid sequence in which 
one or more nucleotides or amino acid residues, respectively, are absent. 

25 

"Insertion" or "addition" - is that change in a nucleotide or amino acid 
sequence w^hich has resuhed in the addhion of one or more nucleotides or amino 
acid residues, respectively, as compared to the naturally occurring sequence. 



"Substitution'' - replacement of one or more nucleotides or amino acids by 
different nucleotides or amino acids, respectively. As regards amino acid 
sequences the substitution may be conserv ative or non- conser\'ati\ e. 

f "'Antibody-- - refers to IgG, IgM. IgD, IgA. and IgG antibod>'. The definition 
includes polyclonal antibodies or monoclonal antibodies. This tenn refers to 
whole antibodies or fragments of the antibodies comprising the antigen-binding 
domain of the anti-CLH product antibodies, e.g. antibodies without the Fc 
ponion. single chain antibodies, fragments consisting of essentially only the 

10 variable, antigen-binding domain of the antibody, etc. 

"Activator" - as used herein, refers to a molecule which mimics the effect of the 
natural CLH product or at times even increases or prolongs the duration of the 
biological activity- of said product, as compared to that induced by the natural 

15 product. The mechanism may be by binding to the same receptor of target 
moieties to which native CLH binds thus mimicking the activity of CLH: by 
prolonging the lifetime of the CLH. (for example by decrease of the rate of its 
degradation) by increasing the activity of the CLH on its target (modulation of 
expression and amount of BMPs). by increasing the affinity of CLH to moieties 

20 which it binds (such as its receptors) etc. Activators may be polypeptides, nucleic 
acids, carbohydrates, lipids, or derivatives thereof, or any other molecules which 
can bind to and activate the CLH product. 

"Deactivator'^ or Inhibitor • refers to a molecule which modulates the 
25 activity of the CLH product in an opposite manner to that of the activator, by 
decreasing or shortening the duration of the biological activity of the CLH 
product. This may be done by blocking the binding of the CLH to its receptor 
(competitive or non-competitive inhibition), by causing rapid degradation of the 
CLH, etc. by inhibiting association of the CLH with the effectors which regulate 
30 the expression of BMPs, etc. Deactivators may be polypeptides, nucleic acids. 



car^oir- crates. lipids, or deri\'ati\'es ihereor or an}' other moiecuie? bmc lo 

anc :::ocuiaie the ac:i\'ir}' of said product. 

''TrearinQ a disease'' - refers to adminisierins a iherareutic substance effecii\'e 
to ^nieiioraie s^mpioins associated wiih a disease, to lessen the ^e\"erir>- or cure 
the disease, or to ore\ eni the disease from occurrine. 

"Detection" - refers to a method of uetection of a cisease. This .erm may refer to 
detection of a predisposition to a disease, 

''Probe*' - the CLH nucleic acid sequence, or a sequence cumplementar}- 
therewith, when used to detect presence of other similar sequences in a sample. 
The detection is carried out b}' identification of h>bridization complexes between 
the probe and the assayed sequence. The probe may be attached to a solid support 
or to a detectable label. 

SUMMARY OF THE INVENTION 

The present invention is based on the surprising finding that there e>dst in 
humans a novel homolog of the chordin protein having a significant homology to 
the chordin protein, the homolog is a part of a longer sequence termed ''full 
sequence The invention is further based on the surprising finding that there exist 
three splice variants to the full sequence which variants are naturally occurring 
sequences produced from the novel homolog through alternative splicing. Both the 
homolog and the three variants of the full sequence are collectively termed as 

The novel CLH (in SEQ ID NO: 1) is a homolog to the known chordins 
w^ithin the VWFC domain, named after the von-Willebrand factor (VWF) type C 
repeat, which is found 2-4 times in these multi-domain proteins. The VWF 
domain has a length of about 70 amino acids covering 10 well conserved cysteines. 
The presence of this region in complex-forming proteins leads to the assumption 



that the VWTC domain might be involved in fomiing larger protein complexes. 
The three variants to the full sequence (for which the homolog is a ponion) (SEQ 
ID NO: 2-4) ha\"e 3 or 4 VW'T t>pe repeats. SEQ ID NO: 2 also has a sequence 
coding for a signal sequence, while SEQ ID NO: 3 and 4 are predicted not to have 
such a signal sequence. 

Thus the present invention provides b>* its first aspect, a no\ el isolated 
nucleic acid molecule comprising or consisting of the sequence of any one of SEQ 
ID NO: 1 to SEQ ID NO: 4, fragments of said sequence having at least 20 nucleic 
acids, or a molecule comprising a sequence having at least 70%. preferably 80%. 
and most preferably 90% identity- to any one of SEQ ID N0:1 to SEQ ID NO: 4. 

The present invention further provides a protein or polypeptide comprising 
or consisting of an amino acid sequence encoded by any of the above nucleic acid 
sequences, termed herein "CLH product'', for example, an amino acid sequence 
having the sequence as depicted in any one of SEQ ID NO: 5 to 8, fragments of the 
above amino acid sequence having a length of at least 10 amino acids, as well as 
homologs of the amino acid sequences of any one of SEQ ID NO: 5 to 8 in which 
one or more of the amino acid residues has been substimied (by conserv ative or 
non-conservative substitution) added, deleted, or chemically modified. 

The present invention further provides nucleic acid molecule comprising or 
consisting of a sequence which encodes the above amino acid sequences, 
(including the fragments and analogs of the amino acid sequences). Due to the 
degenerative nature of the genetic code, a plurality of alternative nucleic acid 
sequences, beyond SEQ ID NO:l to SEQ ID NO: 4, can code for the amino acid 
sequence of the invention. Those alternative nucleic acid sequences which code for 
the same amino acid sequences codes by the sequences of SEQ ID NO: 1 to SEQ 
ID NO: 4 are also an aspect of the of the present invention. 

The present invention further provides expression vectors and cloning 
vectors comprising any of the above nucleic acid sequences, as well as host cells 
trans fecfed by said vectors. 



The preseni in\-ention siiW tlinher proxices phannaceutica. ccmpositions 
^on:rn5ine- as an ac:i\e ingredient, said nucieic acid moleeuies. saic expression 
\"ee:ors- or said protein or poi>'peptide. 

These phar:::aceutieal compositions are suitable for the treatment of diseases 
anc nathoiogica! conditions, wnich can be ameliorated, cured or prexented b>- 
raising the ie\'el of the CLH product. Typicaii}- these are diseases which are 
manifested by non-nonaiai Ie\-els of \-arious BMP proteins (which can be highei or 
lower Than normal le% els) which can lead to pathological conditions associated with 
osteoblasts (such as fibrodx-splasia ossificans progressix'a) or of other tissues of 
miesench\-mal origin. The compositions are intended to restore the BMP levels to 
nonr:al le\-els or to restore the etTect of existing le\-e!s of BMPs to normal or near 
nonrial effects. 

Bv a second aspect, the present invention provides a nucleic acid molecule 
comprising or consisting of a non-coding sequence which is complementar\' to that 
of an>- one of SEQ ID NO: 1 to SEQ ID NO: 4. or complementar}' to a sequence 
ha\ ing at least 70% identir\' to said sequence or a fragment of said two sequences. 
The complementar>- sequence may be a DNA sequence which hybndizes with any 
one of the sequences of SEQ ID NO: 1 to SEQ ID NO: 4. or hybridizes to a portion 
of that sequence having a length sufficient to inhibit the transcription of the 
complementary sequence. The complementary sequence may be a DNA sequence 
which can be transcribed into an niRNA being an antisense to the mRNA 
transcribed from any one of SEQ ID NO: 1 to SEQ ID NO: 4 or into an mRNA 
which is an antisense to a fragment of the mRNA transcribed from any one of SEQ 
ID NO: I to SEQ ID NO: 4 which has a length sufficient to hybridize with the 
mRNA transcribed from any one of SEQ ID NO: 1 to SEQ ID NO: 4. so as to 
inhibit its translation. The complementary sequence may also be the mRNA or the 
fragment of the mRNA itself 

The nucleic acids of the second aspect of the invention may be used for 
therapeutic or diagnostic applications for example for detection of the expression of 
CLH in various tissues such as bones and other tissue of mesenchymal origin. In 



- 11 - 

addition, the ratio between the level of the chordin like homolog of SEQ ID NO: 1 
and that of any of the three splice variants of the full sequence from which the 
homolog of SEQ ID NO: 1 is a pan (SEQ ID NO: 2 to 4), or the ratio of each of the 
splice \ ariants to another splice variant may also be indicative of a pluralir\^ of 
5 physiological or pathological conditions. 

The present invention also provides expression vectors comprising any one 
of the above defined complementarv* nucieic acid sequences and host cells 
transfected with said nucleic acid sequences or vectors, being complementarv* to 
those specified in the first aspect of the invention. 

10 The invention also provides anti-CLH product antibodies, namely antibodies 

directed against the CLH product which specifically bind to said CLK product. 
Said antibodies are_ useful both for diagnostic and therapeutic purposes. For 
example said antibody may be as an active ingredient in a pharmaceutical 
composition as will be explained below. 

15 The present invention also provides pharmaceutical compositions 

comprising, as an active ingredient, the nucleic acid molecules which comprise or 
consist of said complememar\^ sequences, or of a vector comprising said 
complementary sequences. The pharmaceutical composition thus provides 
pharmaceutical compositions comprising, as an active ingredient, said anti-CLH 

20 product antibodies. 

The pharmaceutical compositions comprising said anti-CLH product 
antibodies or the nucleic acid molecule com.prising said complementary sequence, 
are suitable for the treatment of diseases and pathological conditions where a 
therapeutically beneficial effect may be achieved by neutralizing the CLH or 

25 decreasing the amount of the CLH product or blocking its binding to its target (for 
example its receptor), for example, by the neutralizing effect of the antibodies, or 
by the decrease of the effect of the antisense mRNA in decreasing expression level 
of the CLH product. Mostly these diseases are manifested by non-normal levels of 
BN^IPs in the diseased persons, or by non-normal effect of BMPs (even at regular 

30 levels) on their targeting cells. Said non-norrnafeffect is usually Higher than normal 



!e^■e:s- ma\' be down-regulaied :c produce normal ie\'eis. b>' utilizing the 
pharn:aceuLicai compositions of the in\ ention. 

According to the third aspect or the inA'ention the present in'. ention pro\udes 
methods tor detectmg the le\'el of the transcript (inRNA} of said CLH product in a 
boc}' tRud sample, or :n a speciilc tissue sample inotabh' bone tissue ), for example 
b^■ use of probes comiprising or consisting of said coding sequences: as well as 
micthods for detectm.g !e\'eli of expression of said product in tissue, e.g. cy the use 
of antibodies capable of specifical!}- reacting with the abo\ e amino acid sequences. 
Detection of the le\ el of the expression of the CLH of the irA emion may be 
ind:caii\'e of a pluralirv" of ph\'siological or pathological conditions. 

The method, according to this laner aspect, for detection of a nucleic acid 
sequence wnich encodes the CLH product in a biological sample, comprises tne 
steps of: 

I a) prox'iding a probe comprising at least one of the nucleic acid 
seauence defined above: 

(b) contacting the biological sample with said probe under conditions 
allowine hvbridization of nucleic acid sequences thereby enabling formation of 
hybridization complexes; 

fc) detecting hybridization complexes, wherein the presence of the 
complex indicates the presence of nucleic acid sequence encoding the CLH product 
in ihe biological sample. 

The amount of hybridization complexes may be determined and calibrated 
b\' comnarins it to a calibration scale in order to determine the amount of the 
nucleic acid sequence w^hich enables the CLH product in the sample. The level of 
each of the variants (SEQ ID NO: 2-4) may be detected and either compared to the 
level of the homolog (SEQ ID NO: 1) or to the level of other variants, and said 
ratio may also be indicative to a plurality of pathological or physiological 
conditions. 

By a preferred embodiment the probe is part of a nucleic acid chip used for 
detection purposes, i.e. the probe is a part of an array of probes each present in a 
known location on a solid support. 



The nucleic acid sequence used in the above method may be a DNA 
sequence an RNA sequence, etc: it may be a coding or a sequence or a sequence 
complementan- thereto (for respective detection of RNA transcripts or 
coding-DNA sequences). By quantization of the le\ el of hybridization complexes 
and calibrating the quantified results it is possible also to detect the level of the 
transcript in the sample. 

Methods for detecting mutations in the region coding for the CLH product 
are also provided, which may be methods carried-out in a binar}* fashion, namely 
merelv detecting whether there is any mismatches between the normal CLH nucleic 
acid sequence and the one present in the sample, or carried-out by specifically 
detecting the nature and location of the mutation. 

The present invention also concerns a method for detecting CLH product 
both for determining its presence, as well as its level or aherations in its level in a 
biological sample, comprising the steps of: 

(a) contacting with said biological sample the antibody of the invention, 
therebv formins an antibodv-antisen complex; and 

(b) detecting said antibody-antigen complex 

wherein the presence of said antibody-antigen complex correlates with the 
presence of CLH product in said biological sample. 

The present invention also concerns a method for detecting anti-CLH 
antibodies in a biological sample comprising the steps of: 

(a) contacting said biological sample with the product of the invention 
thereby forming an antibody-antigen complex; and 

(b) detecting said antibody- antigen complex 

wherein the presence of said antibody-antigen complex correlates with the 
presence of anti-CHL antibody in said biological sample. 

In many cases, diseases are detected not by detecting the presence of the 
protein (product) which caused the disease, but rather by detecting the presence in a 
biological sample (such as blood or serum) of antibodies against such a product. 
The method of detecting the presence of anti-CLH antibodies is intended to be used 



1 ne ai^ioun: or :he aniiroc} -ani:gen complex can t?e quani:i:zec. m orcer lo 
ccteHwine :he :e\ e: of the CKL-rroduct or the anti-CHL antibodies, as the case 

As explained abo\"e the le\"el of the product of SEQ ID NO: 5 i of the no\"ei 
homolog* can be compared to the ie\'e! of an\' one of the products of the \"anants 
(SEC EE; ^^OS. 6-8 f. or the le\-el of the xariant products ma>' be com.pared to each 
othen and the ^*atio berA\"een the levels ma\" be indicatr/e to a nluralir\' of 
ph>"siological and pathological conditions. In addition, the indicati\'e ratio may not 
be the ratio of the proteins themseK'es but rather the ratio of antibodies against the 
proteins. 

B}" >"et another aspect the mi\'ention also pro\'ides a method for identif>'ing 
candidate compounds capable of modulating the activitx' of CLH product (being 
either acti\'ators or deactivators). The method includes: 

(i) providing a protein or polypeptide comprising an amino acid 
sequence substantially as depicted in any one of SEQ ID NO: 5 to SEQ ID NO: 8. 
or a fragment of such a sequence; 

(Vi) contacting a candidate compound with said amino acid sequence: 

(iii) comprising the physiological effect of the amino acid sequence in the 
presence and absence of said candidate compound and selecting those compounds 
which show a significant effect on said physiological activity. 

The activit\^ of the amino acid which should be chansed bv the modulator 
(being either the activator or deactivator) may be for example the binding of the 
CEH product to its receptor, the effect of CLH on BMPs expression or activity. 
Any modulator which changes such an activity has an infecting potential, as 
serv ing as an actuator or deactivator. 

The present invention also concerns compounds identified by the above 

methods described above, wWch compound may either be an acti\'ator of the CLH 
product or a deactivator thereof 



BRIEF DESCRIPTION OF THE DRAWINGS 

In order to understand the invention and to see how ii may be carried out in 
practice, a preferred embodiment will now be described, by way of non-limiting 
example only, with reference to the accompan> ing dra\\ ings- in which: 

Fig. 1 is alignment of the CLH product of SEQ ID NO: 5 to known chordin 
protein, demonstrating the homolog>' regions within these proteins. The aignment 
was perfonned using best-fit of GCG: 

Fig. 2a is the alignment of the first splice variant (SEQ ID NO: 6) to the 
known chordin deposited in the Emb as gi 4808227; 

Fig. 2b is the alignment of the first splice variant (SEQ ID NO: 6) to the 
i<nov. n chordin deposited in the Emb under gi 38222 18; 

Fig. 2c is the- alignment of the first splice variant (SEQ ID NO: 6) to the 
known chordin deposited in the Emb under gi 3800772; 

Fig. 3a is the alignment of the second splice variant (SEQ ID NO: 7) with a 
known chordin deposited in the Emb under gi 4808227; 

Fig. 3b is the alignment of the second splice variant (SEQ ID NO: 7) with a 
known chordin deposited in the Emb under gi 3822218; 

Fig. 4a is the alignment of the third splice variant (SEQ ID NO: 8) with a 
known chordin deposited in the Emb under gi 4808227;. 

Fig. 4b is the alignment of the third splice variant (SEQ ID NO: 8) with a 
known chordin deposited in the Emb under gi 273 1578; 

Fig. 4c is the alignment of the third splice variant (SEQ ED NO: 8) with a 
known chordin deposited in the Emb under gi 2498235; 

Fig. 4d is the alignment of the third splice variant (SEQ ID NO: 8) with a 
known chordin deposited in the Emb under gi 3822218; 

Fig. 5 is multiple alignments of the sequences to several known chordins. 



DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT 

Ex:in:r:e I: CLH - nucleic acid sequence 

The nucieic acid sequences of ihe in\-enuon include nucleic acic 
seuuences uhich encode CLH produc: and u*agnienis and analogs thereof. The 
^ nucleic ac:d seauences ma\- aliemaLi\-elv be sequences comrlen:eniar>- to the 

— i - ml.*. 

abo\'e codins sequence, or to a region of said coding sequence. The length of the 
con:t:lementaiV seauence is sufficient to avoid the expression of the coding 
sequence. The nucleic acid sequences may be in the form of RNA or in the form 
of DNA. and include messenger RNA. synthetic RNA and DXA. cDNA. and 

- genomic DNA. The DNA ma>- be double-stranded or singie-stranded. and if 
sinsle-stranded may be the coding strand or the non-coding (anti-sense, 
com.rlementap. ) strand. The nucleic acid sequences may also both include 
dNTPs. rNTPs as well as non namrally occurring sequences. The sequence may 
also be a part of a hybrid benveen an amino acid sequence and a nucleic acid 

i> seauence. 

In a general embodiment, the nucieic acid sequence has at least 70%, 
preferably 80% or 90% sequence identity with any one of the sequences 
identified as SEQ ID NO: I to SEQ ID NO: 4. 

The nucleic acid sequences may include the coding sequence by itself. By 
20 another alternative the coding region may be in combination with additional 
coding sequences, such as those coding for fusion protein or signal peptides, in 
combination witn non-coding sequences, such as mtrons ana control elements, 
promoter and terminator elements or 5' and^or 3' untranslated regions, effective 
for expression of the coding sequence in a suitable host, andor in a vector or host 
25 environment in which the CLH nucleic acid sequence is mtroduced as a 
heterologous sequence. 

The nucleic acid sequences of the present invention may also have the 
product coding sequence fused in-frame to a marker sequence which allows for 
punfication of the CLH product. The marker sequence may be. for example, a 



hexahistidine tag to provide for purification of the mature polypeptide fused to 
the marker in the case of a bacterial host, or, the marker sequence may be a 
hemagglutinin (HA) tag when a mammalian host. e.g. COS-7 cells, is used. The 
HA tag corresponds to an epitope derived from the influenza hemagglutinin 
protein (Wilson. L. et al. Cell yrJbl (1984)). 

Also included in the scope of the invention are fragments also referred to 
herein as oligonucleotides. t\-pically having at least 20 bases, preferably 20-30 
bases corresponding to a region of the coding-sequence nucleic acid sequence. 
The fragments may be used as probes, primers, and when complementar>' also as 
antisense agents, and the Uke, according to known methods. 

As indicated above, the nucleic acid sequence may be substantially a 
depicted in any one^of SEQ ID NO: 1 to SEQ ID NO: 4 or fragments thereof or 
sequences having at least 70%, preferably 70-80%, most preferably 90% identity 
to the above sequence. Ahematively, due to the degenerative nature of the 
genetic code, the sequence may be a sequence coding the amino acid sequence of 
any one of SEQ ID NO: 5 to SEQ ID NO: 8, or fragments or analogs of said 
amino acid sequence. 

A. Preparation of nucleic acid sequences 

The nucleic acid sequences may be obtained by screening cDNA libraries 
using oligonucleotide probes which can hybridize to or PCR-amplify nucleic acid 
sequences which encode the CLH products disclosed above. cDNA libraries 
prepared from a variety of tissues are commercially available and procedures for 
screening and isolating cDNA clones are well-known to those of skill in the art. 
Such techniques are described in, for example, Sambrook et al (1989) Molecular 
Cloning: A Laboratory Manual (2nd Edition), Cold Spring Harbor Press, 
Plainview, N.Y. and Ausubel FM et al. (1989) Ctirrent Protocols in Molecular 
Biology, John Wiley & Sons, New York, N.Y. 

The nucleic acid sequences may be extended to obtain upstream and 
downstream sequences such as promoters, regulatory elements, and 5* and 3* 
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re rer:or::^ed b}" nui"eroiis me:hous kiio\\'n lo lho^e of skill in :he ^n. such as 
PCR or primer ex:en:^ion i Sambrcok o: ^7.'.. siir'-cr. or b> :he R_ACE nieihod 
us:::i. ibr examrie. ihe Marathon R_ACE kii (Clontech. Cat. = KISlC-I ). 

AlLcrnati\-e!> , the technique of "''csnrc::0':-s:ie" PCR (Gobmda c: c::. PCR 
Me:-:ocis Apviic. 2:3i8-22. (1993 1). which uses uni\ersal primers to retrie\'e 
flanKing sequence adjacent a kno^^■n locus. ma>' be emplo>ed. First, genomic 
DNA is amplified m the presence of primer to a linker sequeiice and a primer 
specific to the knoun region. The amplified sequences are subjected to a second 
round of PCR with the same linker primier and another specific pnmier internal to 
[he frst one. Products of each round of PCR are transcribed with an appropriate 
RNA polymerase and sequenced using reverse transcriptase. 

Inverse PCR can he used to amplify- or extend sequences using divergent 
prim.ers based on a known region (Triglia. T. et ai. Nucleic Adas Res. 16:81 86. 
(1988)). The prnners may be designed using OLIGO(R) 4.06 Primer Analysis 
SofcAvare (1992: National Biosciences Inc. Plymouth. Minn.), or another 
appropriate program, to be 22-30 nucleotides in length, to have a GC content of 
50° 0 or miore. and to anneal to the target sequence at temperatures about 68-72°C. 
The method uses several restriction enz\'mes to generate a suitable fragment in 
the known region of a gene. The fragment is then circularized by intramolecular 
ligation and used as a PCR template. 

Capture PCR (Lagerstrom. M. et ai, PCR Methods Applic. 1:111-19, 
(1991)) is a method for PCR amplification of DNA fragments adjacent to a 
known sequence in human and yeast artificial chromosome DNA. Capture PCR 
also requires multiple restriction enz\'me digestions and ligations to place an 
engineered double-stranded sequence into a flanking part of the DNA molecule 
before PCR. 

.Another method which may be used to retrieve flanking sequences is that 
of Parker. J.D.. et ai. Nucleic Acids Res., 19:3055-60, (1991)). Additionally, one 
can use PCR. nested pnmers and PromoterFinder'^ libraries to '\va/^ in" genomic 
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DNA i^PromoterFinder'^i; Clontech, Palo Alto, CA). This process avoids the need 
to screen libraries and is useful in finding introaexon junctions. Preferred 
libraries for screening for full length cDNAs are ones that have been size-selected 
to include larger cDNAs. Also, random primed libraries are preferred in that they 

5 will contain more sequences which contain the 5' and upstream regions of genes. 

A randomly primed librar>* may be panicularly useful if an oligo d(T) 
librar\- does not vield a full-lensch cDNA. Genomic libraries are useful for 
extension into the 5' nontranslated regulatory* region. 

The nucleic acid sequences and oligonucleotides of the im ention can also 

10 be prepared by solid-phase methods, according to known synthetic methods. 
Typically, fragments of up to about 100 bases are individually synthesized- then 
joined to form continuous sequences up to several hundred bases. 



B. Use of CLH nucleic acid sequence for the production of CLH products 
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In accordance with the present invention, nucleic acid sequences specified 
above may be used as recombinant DNA molecules that direct the expression of 
CLH products. 

As will be understood by those of skill in the art, it may be advantageous 
20 to produce CLH product-encoding nucleotide sequences possessing codons other 
than those which appear in any one of SEQ ID NO: 1 to SEQ ID NO: 4 which are 
those which naturally occur in the human genome. Codons preferred by a 
panicular prokaryotic or eukaryotiCLKost (Murray, E. et gL Nuc Acids Res., 
17:477-508, (1989)) can be selected, for example, to increase the rate of CLH 
25 product expression or to produce recombinant RNA transcripts having desirable 
properties, such as a longer half-life, than transcripts produced from naturally 
occurring sequence. 

The nucleic acid sequences of the present invention can be engineered in 
order to altefaXLH product coding sequenci^fbr~a~vmiety"afTeason^rin^^^^ 
30 but not limited to, alterations which modify the cloning, processing and/or 



exrression oi me croauc:. ror examcie. aiieranons i"a% ce mirocucea UMne 
:echniques which are well known in the an, e.g.. site-direcred n^uiagenesis. to 
;nser: new resinction sites, to alter gh'cos>"iation ranems. to change codon 
rreference. to produce splice \'ariants. etc. 

The present in\"eniion also includes recombinant constrt:cts comprising 
one or more of the sequences as broadh' described abo\'e. The constructs 
com.nnse a \-ector. such as a plasmid or \"iral \-ector. into which a nucleic acid 
seauence of the invention has been insened. in a forward or re\'erse orientation. 
In a preferred aspect of this embodiment, the construct funher com.prises 
resuiator}- sequences, including, for example, a prom^oter. operabh' linked to the 
sequence. Large numoers of suitable vectors and promoters are know n co Lho>e 
of skill in the an. "and are commercially available. Appropriate cloning and 
expression \ectors for use with prokar\-otic and eukar\otic hosts are also 
described in Sambrook. era/., (supra). 

The present invention also relates to host cells which are genetically 
engineered with vectors of the invention, and the production of the product of the 
invention by recombinant techniques. Host cells are genetically engineered (i.e.. 
transduced, transformed or transfected) with the vectors of this invention which 
ma\ be. for example, a cloning vector or an expression vector. The vector may 
be. for example, in the form of a plasmdd. a viral particle, a phage, etc. The 
eneineered host cells can be cultured in conventional nutrient media modified as 
appropriate for activating promoters, selecting transformants or amplifying the 
expression of the CLH nucleic acid sequence. The culture conditions, such as 
temperature. pH and the like, are those previously used with the host cell selected 
for expression, and will be apparent to those skilled in the art. 

The nucleic acid sequences of the present invention may be included in 
an} one of a variety of expression vectors for expressing a product. Such vectors 
include chromosomal, nonchromosomal and synthetic DNA sequences, e.g.. 
derivatives of SV40: bacterial plasmids: phage DNA: baculovirus; yeast 
plasmids: vectors derived from combinations of plasmids and phage DNA, viral 



-21 - 

DNA such as vaccinia, adenovirus, fowl pox virus, and pseudorabies. However, 
any other vector may be used as long as it is replicable and viable in the host. 
The appropriate DNA sequence may be insened into the x ector b\" a variety- of 
procedures. In general, the DNA sequence is insened into an appropriate 
5 restriction endonuclease site(s) b>' procedures known in the art. Such procedures 
and related sub-cloning procedures are deemed to be within the scope of those 
skilled in the an. 

"^he DNA sequence in the expression vector is operatively linked to an 
appropriate transcription control sequence (promoter) to direct mRNA synthesis. 

10 Examples of such promoters include: LTR or SV40 promoter, the E.coli lac or 
irp promoter, the phage lambda PL promoter, and other promoters known to 
control expression, of genes in prokar\^otic or eukaryotic cells or their viruses. 
The expression vector also contains a ribosome binding site for translation 
initiation, and a transcription terminator. The vector may also include 

15 appropriate sequences for amplifying expression. In addition, the expression 
vectors preferably contain one or more selectable marker genes to provide a 
phenotypic trait for selection of transformed host cells such as dihydrofolate 
reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicillin resistance in ExolL 

20 The vector containing the appropriate DNA sequence as described above, 

as well as an appropriate promoter or control sequence, may be employed to 
transform an appropriate host to permit the host to express the protein. Examples 
of appropriate expression hosts include: bacterial cells, such as E.coli., 
Sti'eptomyces, Salmonella typhimiiriiim: fungal cells, such as yeast; insect cells 

25 such as Drosophila and Spodoptera Sf9; animal cells such as CHO, COS, HEK 
293 or Bowes melanoma; adenoviruses; plant cells, etc. The selection of an 
appropriate host is deemed to be within the scope of those skilled in the art from 
the teachings herein. The invention is not limited by the host cells employed. 

In bacterial systems, a number of expression vectors may be selected 

30 depending upon the use intended for the CLH product. For example, when large 



^uaniiiics Oi lLm rrocaci are neeaea lor ine incuct:on oi aniicoaies. \eciors 
\vh:ch direct higii ;c\"ci expression of fiision protems that are readu}' piirmed :na\' 
be desirable. Such \"ectors inckide, but are not hinited to. uiuhirunctional E.cOi: 
ciordng and expression \"ecLors such as BUiCscripaR) ( Straiagene ). in which the 
CLH cohr^-tide codin2 sequence ma\' be liizated into the \ecror in-iranie with 
secuences for the amino-tenninai Met and the subsequent " residues of 
heia-oalactosidase so that a h\"brid protein is produced; pIX '/ectors (\'an Heeke 
& Schuster J. Bioi. Chem. 264:5503-5509. (1989j): pET vectors (Novagen. 
\k;dison Wl ): and the hke. 

In the \"east Saccharomvces cerevisiae a number of \'ecLors containing 
constitutive or inducible promoters such as alpha factor, alcohol oxidase and 
PGH ma\" be used". For reviews, see Ausubel et ai (supra) and Grant et al.. 
\Meihods in Enzymoiogy ]_53:5 1 6-544. (1987)). 

In cases where plant expression vectors are used, the expression of a 
sequence encoding CLH product may be driven b}- any of a number of promoters. 
For example, \iral promoters such as the 35S and 19S promoters of CajVJT' 
(Brisson et al, Nature 310:51 1-514. (1984)) may be used alone or in combination 
with the omega leader sequence from TM\' (Takamatsu et ai. EMBO J., 
6:507-3 IL (1987)). Alternatively, plant promoters such as the small subunit of 
RUBISCO (Coruzzi et al, EMBO J. 3:1671-1680, (1984); Brcglie et al. Science 
224:838-843. (198-^)); or heat shock promoters (Winter J and Sinibaldi R.M., 
Results Probl Cell Differ,, 17:85-105, (1991)) may be used. These constructs 
can be introduced into plant cells by direct DNA transformation or 
Dathoeen-mediated transfection. For reviews of such techniques, see Hobbs S. or 
Murrv- L.E. (1992) in McGraw Hill Yearbook of Science and Technolog\', 
McGraw HilL New York. N.Y., pp 191-196; or Weissbach and Weissbach (1988) 
Methods for Plant Molecular Biology, Academic Press, New York. N.Y., pp 
421-463. 

CLH product may also be expressed in an insect system. In one such 
system. Autographa caiifomica nuclear polyhedrosis virus (AcNTV) is used as a 



\ ecior to express foreign genes in Spodoptera fi'ngiperda cells or in Trichophisia 
larv ae. The CLH product coding sequence may be cloned into a nonessential 
region of the virus, such as the poiyhedrin gene, and placed under control of the 
pol> hedrin promoter. Successful insenion of CLH coding sequence will render 
the poiyhedrin gene inactive and produce recombinant virus lacking coat protein 
coat. The recombinant viruses are then used to infect 5. friigiperdc cells or 
Triciwphisia larvae in which CLH protein is expressed (Smith et ai, J. Virol. 
46:584. (1983); Engelhard, E.K. et ai, Proc. Nat. Acad. ScL 91:3224-7. (1994)). 

In mammalian host cells, a number of viral-based expression systems may 
be utilized. In cases where an adenovirus is used as an expression vector, a CLH 
product coding sequence may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a nonessential El or E3 region of the viral genome 
will result in a viable virus capable of expressing CLH protein in infected host 
cells (Logan and Shenk, Proc. Natl. Acad. ScL 81:3655-59, (1984). In addition, 
transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be 
used to increase expression in mammalian host cells. 

Specific initiation signals may also be required for efficient translation of 
a CLH protein coding sequence. These signals include the ATG initiation codon 
and adjacent sequences. In cases where CLH product coding sequence, its 
initiation codon and upstream sequences are inserted into the appropriate 
expression vector, no additional trans! ational control signals may be needed. 
However, in cases where only coding sequence, or a portion thereof, is inserted, 
exogenous transcriptional control signals including the ATG initiation codon 
must be provided. Furthermore, the initiation codon must be in the correct 
reading frame to ensure transcription of the entire insert. Exogenous 
transcriptional elements and initiation codons can be of various origins, both 
natural and synthetic. The efficiency of expression may be enhanced by the 
inclusion of enhancers appropriate to the cell system in use (Scharf, D. et aL, 
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E^:--^io: 153:516-544. ( 198")). 

In a runner einbodiment. :he present indention relaies to host ceils 

coniainme the abo\"e-Gescnbed constructs. The host cell can be a higher 
: eukai-^otic celL such as a man^maiian ceil, or a lower eukar>-otic cell, such as a 

\east cell- or the host cell can be a prokar>-otic cell, such as a bacterial cell. 

Introduction of the construct into the host cell can be effected by calcium 

phosphate transfection. DEAE-Dextran mediated :ransfection. or electroporation 

(Da\is. L.. Dibner. M.. and Bauey. I. (1986) Basic Methods m Molecular 
Ml Bioloov). Cell-free translation s\-stems can also be emplo>ed to produce 

pohpeptides using RNAs derived from die DNA constructs of tne present 

in\ention. 

A host ceil strain mav be chosen for its abilir\' to modulate the expression 
of the inserted sequences or to process the expressed protein in the desired 

15 fashion. Such modifications of the protein include, but are not limited to. 
acetvlation. carboxylation. glycosylation. phosphorylation, lipidation and 
acylation. Post-translational processing which cleaves a "pre-pro" form of the 
protein may also be imiponant for correct insertion, folding and/or function. 
Different host cells such as CHO, HeLa. MDCK, 293. W138, etc. have specific 

:o cellular machiner\' and characteristic mechanisms for such post-translational 
activities and may he chosen to ensure the correct modification and processing of 
the introduced, foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 
expression is preferred. For example, cell lines which stably express CLH 

25 product may be transformed using expression vectors which contain viral origins 
of replication or endogenous expression elements and a selectable marker gene. 
Following the introduction of the vector, cells may be allowed to grow for 1-2 
days m an enriched media before ihey are switched to selective media. The 
purpose of the selectable marker is to confer resistance to selection, and its 

30 presence allows growth and recoverv- of cells which successfully express the 



introduced sequences. Resistant clumps of stably transformed cells can be 
proliferated using tissue culture techniques appropriate to the cell t\-pe. 

An^' number of selection svstems mav be used to recover transformed cell 
lines. These include, but are not limited to. the herpes simplex virus thymidine 

5 kinase (Wigler M.. ei oL, Cell lJ.:223-32. (1977)) and adenine 
phosphoribosyltransferase (Lowy I., ei al., Cell 22:817-23. (1980)) genes which 
can be employed in tk- or apn- cells, respectively. Also, antimetabolite, 
antibiotic or herbicide resistance can be used as the basis for selection; for 
example, dhfr which confers resistance to methotrexate (Wigler M.. et aL, Proc. 

10 Naii Acad. ScL 77:3567-70; (1980)); ?^pi, which confers resistance to the 
aminoglycosides neomycin and G-418 (Colbere-Garapin, F, et ai, J. Mol. Biol., 
150 :1-14, (1981)) and als or pat, which confer resistance to chlorsulfuron and 
phosphinotricin acet\'itransferase, respectively (Murry, supra). Additional 
selectable genes have been described, for example. trpB, which allows cells to 

15 utilize indole in place of tryptophan, or hisD, v/hich allows cells to utilize histinol 
in place of histidine (Hartman S.C. and R.C. Mulligan, Proc. Natl. Acad. Sci, 
85:8047-51, (1988)). The use of visible markers has gained popularity with such 
markers as anthocyanins, beta-glucuronidase and its substrate, GUS, and 
luciferase and its substrates, luciferin and ATP, being widely used not only to 

20 identify transformants, but also to quantify the amount of transient or stable 
protein expression attributable to a specific vector system (Rhodes, C.A. et. al, 
Methods Mol Biol. 55:121-131, (1995)). 

Host cells transformed with a nucleotide sequence encoding CLH product 
may be cultured under conditions suitable for the expression and recovery of the 

25 encoded protein from cell culture. The product produced by a recombinant cell 
may be secreted or contained intracellularly depending on the sequence and/or the 
vector used. As will be understood by those of skill in the art, expression vectors 
containing nucleic acid sequences encoding CLH product can be design ed with 
signal sequences which direct secretion of CLH product through a prokaryotic or 

30 eukaryotic cell membrane. 



CLH rrocuc: :::a^" also be exrressed as a recombinant protein ^^":th one or 
n;ore additional -o]}jc]:::dc donaan:s added lo :ac:l:tate nrote:n purincation. 
Sucn rur:rlcat:on lacduatme domains include, but are not limited to. metai 

^ ' — 

cheiatin.LL peptides such as hi5Lidine-ir\"ptophan m.odules that aliovv purification on 
nmm.obilized m.etais. protein A domains that aHo\\ purification on mimobilized 
imnaunoolobuiin. and the domain utihzed in the FLAGS extension atTinir>- 
pur:ncation 5>-stem (hiimunex Corp. Seanle. Wash.). The inclusion of a 
protease-cleaxabie poh'peptide linker sequence beiAveen the purincation domiain 
and CLH protein is usehul to facilitate purification. One such expression \-ector 
pro\'ides for expression of a fusion protein com.promising a CLH polypeptide 
fused to a pob/histidine region separated by an enterokinase clea\-age site. The 
histidine residues - facihtate purification on IMIAC (iiranobilized metai ion 
aff nit}- chromatography, as described in Porath. et aL, Protein Expression and 
Purincation, 3:263-28 L (1992)) while the enterokinase cleavage site provides a 
means for isolating CLH polypeptide from the fusion protein. pGEX vectors 
(Promega. Madison. Wis.) may also be used to express foreign polypeptides as 
fusion proteins with glutathione S-transferase (GST). In general, such fusion 
proteins are soluble and can easily be purified from lysed cells by adsorption to 
lisand-agarose beads (e.g., glutathione-agarose in the case of GST-fusions) 
followed by elution in the presence of free ligand. 

Followdng transformation of a suitable host strain and growth of the host 
strain to an appropriate cell density, the selected promoter is induced by 
appropriate means (e.g.. temperature shift or chemical mduction) and cells are 
cultured for an additional period. Cells are t\pically harvested by centrifugation, 
disrupted by physical or chemical means, and the resulting crude extract retained 
for farther purification. Microbial cells employed in expression of proteins can 
be disrupted by any convenient method, including freeze-thaw cycling, 
sonication. mechanical disruption, or use of cell lysing agents, or other methods, 
which are well know to those skilled in the art. 
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The CLH products can be recovered and purified from recombinant cell 
cultures by any of a number of methods well known in the an, including 
ammonium sulfate or ethanol precipitation, acid extraction, anion or cation 
exchange chromatography, phosphocellulose chromatograph}'. hydrophobic 
f interaction chromatography. affinir\* chromatography. h>droxylapatite 
chromatography, and lectin chromatography. Protein refolding steps can be used, 
as necessar\', in completing configuration of the mature protein. Finally, high 
performance liquid chromatography (HPLC) can be employed for final 
purification steps. 

10 

C. Diagnostic applications utilizing nucleic acid sequences 

The nucleic- acid sequences of the present invention may be used for a 
variety' of diagnostic purposes. The nucleic acid sequences may be used to detect 
and quantitate expression of CLH in patient's cells, e.g. biopsied tissues, by 

15 detecting the presence of mRNA coding for CLH product. Alternatively, the 
assav mav be used to detect soluble CLH in the serum or blood. This assay 
t\'pically involves obtaining total mRNA from the tissue or serum and contacting 
the mRNA with a nucleic acid probe. The probe is a nucleic acid molecule of at 
least 20 nucleotides, preferably 20-30 nucleotides, capable of specifically 

20 hybridizing with a sequence included w^ithin the sequence of a nucleic acid 
molecule encoding CLH under hybridizing conditions, detecting the presence of 
mRNA hybridized to the probe, and thereby detecting the expression of CLH. 
This assay can be used to distinguish between absence, presence, and excess 
expression of CLH product and to monitor levels of CLH expression during 

25 therapeutic intervention. 

The invention also contemplates the use of the nucleic acid sequences as a 
diagnostic for diseases resulting from inherited defective CLH sequences. These 
sequences can be detected by com paring t he sequences of the defective (i.e., 
mutant) CLH coding region with that of a normal coding region. Association of 

30 the sequence coding for mutant CLH product with abnormal CLH product 



acii\"irA- n:a> be \"er:ilec:. In addition, sequences encoding niuiani CLH products 
can be insened into a suitable \'ecTor tor expression in a functional assay s>'stem 
le.e.. coiorimeiric assa\". complementation experiments in a CLH protein 
deilcient strain oi KEK293 ceils ) as \"et another micans to \'erin' or identify- 
muuaiions. Once miUtant genes ha^■e been identified, one can tlien screen 
populations of interest for carriers of the mutant gene. 

Indi\ iduals carr^'ing mutations m the nucleic acid sequence of the present 
in\ ention may be detected at the DNA le\-el by a variet}* of Lechniques. Nucleic 
acids used tor diagnosis ma>' be obtamed from a patient's cells, including but not 
limited to such as from blood, urine, saliva, placenta, tissue biopsy and autopsy 
miaterial. Genomic DNA may be used directly for detection or may be ampiilled 
enz^matically by using PGR (Saiki. et al„ Sature 324 :163-166. (1986)) prior to 
anah sis. RNA or cDNA ma\ also be used for the same purpose. As an example. 
PGR primers complementar>- to the nucleic acid of the present invention can be 
used to identifv- and analvze mutations in the 2ene of the present invention. 
Deletions and insertions can be detected by a change in size of the amplified 
product in comparison to the normal genotype. 

Point mutations can be identified by hybridizing amplified DNA to 
radiolabeled RNA of the invention or ahematively, radiolabeled antisense DNA 
sequences of the invention. Sequence changes at specific locations may also be 
re\ealed by nuclease protection assays, such RNase and SI protection or the 
chemical cleavage method (e.g. Cotton, et alProc. Natl. Acad Sci. USA, 
85:4397-4401, (1985)), or by differences in melting temperatures. "Molecular 
beacons'' (Kostrikis L.G. et al. Science 279:1228-1229, (1998)), hairpin-shaped, 
sin2le-stranded synthetic oligo- nucleotides containing probe sequences which 
are complementary to the nucleic acid of the present invention, may also be used 
to detect point mutations or other sequence changes as well as monitor expression 
levels of CLH product. Such diagnostics would be particularly useful for 
prenatal testing. 



.■\nother method for detecting mutations uses two DNA probes which are 
designed to hybridize to adjacent regions of a target, with abuning bases, where 
the region of known or suspected mutation(s) is at or near the abuning bases. 
The two probes may be joined at the abuning bases, e.g.. in the presence of a 
Hsase enzy me, but only if both probes are correctly base paired in the region of 
probe junction. The presence or absence of mutations is then detectable by the 
presence or absence of ligated probe. 

Also suitable for detecting mutations in the CLH product coding sequence 
are oligonucleotide array methods based on sequencing by hybridization (SBH), 
as described, for example, in U.S. Patent No, 5.547,839. In a lypical method, the 
DNA tareet analvte is hvbridized with an arrav of olieonucleotides formed on a 
microchip. The sequence of the target can then be "read'* from the pattern of 
target binding to the array. 

D. Gene mapping utilizing nucleic acid sequences 

The nucleic acid sequences of the present invention are also valuable for 
chromosome identification. The sequence is specifically targeted to and can 
hvbridize with a particular location on an individual human chromosome. 
Moreover, there is a current need for identifying particular sites on the 
chromosome. Few chromosome marking reagents based on actual sequence data 
(repeat polymorphisms) are presently available for marking chromosomal 
location. The mapping of DNAs to chromosomes according to the present 
invention is an important first step in correlating those sequences with genes 
associated with disease. 

Briefly, sequences can be mapped to chromosomes by preparing PGR 
primers (preferably 20-30 bp) from the CLH cDNA. Computer analysis of the 3' 
untranslated region is used to rapidly select primers that do not span more than 
one exon in the genomic DNA, which would complicate the amplification 
process. These primers are then used for PGR screening of somatic cell hybrids 



coniaining :ndi\"iduai human criromoson^es. Only those h>"brids containing the 
hinnan gene corresponding to the primer w ill >"ield an amplified fragiTient. 

PCR m.apping of somatic cell h\"br:ds or usm.g instead radiation h>'bnds 
are rapid procedures for assigning a paracular DXA to a panicalar chromosome. 
Usm^ the present in\-ention \\\ih the sam.e oiigonucieotide primers, 
-ublocalizaiion can be achie\ed with panels of fragments from specific 
chromosomes or pools of large genomic clones in an analogous manner. Other 
mapping strategies that can similarly be used to map to its chromosome include :n 
h\bridization. prescreening with labeled tlow-sorted chromosomes and 
preselection b>' hxbridization to construct chromosome specific-cDNA libraries. 

Fluorescence in sim hybriaizaiion (FISH) of a cDNA clone to a metaphase 
chromosomal spread can be used to provide a precise chromosomal location in 
one step. This technique can be used with cDNA as short as 50 or 60 bases. For a 
re\ iew of this technique, see Verma et ai. Human Chromosomes: a Manual of 
Basic Techniques. 0988) Pergamon Press, New York. 

Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with 
genetic map data. Such data are found, for exampie. in che OMIM database 
(Center for Medical Genetics, Johns Hopkins Universit\\ Bahimore, MD and 
National Center for Biotechnolog>^ Information, National Libran.' of Medicine, 
Bethesda. MD). The OMIM gene map presents the cytogenetic map location of 
disease genes and other expressed genes. The OMIM database provides 
information on diseases associated with the chromosomal location. Such 
associations include the results of linkage analysis mapped to this intervak and 
the correlation of translocations and other chromosomal aberrations in this area 
with the advent of various diseases associated with abnormal amounts or function 
ofBMPs. 
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E. Therapeutic applications of nucleic acid sequences 

Nucleic acid sequences of the invention may also be used fpr therapeutic 
purposes. Turning first to the second aspect of the invention (i.e. inhibition of 
expression of CLH). expression of CLH product may be modulated through 

5 antisense technology, which controls gene expression through hybridization of 
complementar>' nucleic acid sequences, i.e. antisense DNA or RNA. to the 
control. 5' or regulator}' regions of the gene encoding CLH product. For 
example, the 5' coding portion of the nucleic acid sequence sequence which 
codes for the product of the preseni invention is used to design an antisense 

10 oHsonucleotide of from about 10 to 40 base pairs in length. Oligonucleotides 
derived from the transcription CLHt site. e.g. benveen positions -10 and -MO 
from the CLHt site^ are preferred. An antisense DNA oligonucleotide is designed 
to be complementarv* to a region of the nucleic acid sequence involved in 
transcription (Lee et ai, Nucl. Acids, Res., 6:3073, (1979): Cooney et aL, Science 

15 241:456, (1988): and Dervan et ai, Science 251:1360, (1991)), thereby 
pre\ enting transcription and the production of the CLH products. An antisense 
RNA olisonucleotide hvbridizes to the mRNA in vivo and blocks translation of 
the mRNA molecule into the CLH products (Okano J. Neurochem. 56:560, 
(1991)). The antisense constructs can be delivered to cells by procedures known 

20 in the art such that the antisense RNA or DNA may be expressed in vivo. The 
antisense may be antisense mRNA or DNA sequence capable of coding such 
antisense mRNA. The antisense mRNA or the DNA. coding thereof can be 
complementary to the foil sequence of nucleic acid sequences coding to the CLH 
protein or to a fragment of such a sequence which is sufficient to inhibit 

25 production of a protein product. 

Turning now to the first aspect of the invention, i.e. expression of CLH, 
expression of CLH product may be increased by providing coding sequences for 
coding for said product under the control of suitable control elements ending its 
expression in the desired host. 



The nucieic acid sequences of the :n\'en:ion ma\" be en:rio\-ed in 
combination with a suitable phairnaceuticai carrier. Such compositions comprise 
a therapeuiicalh efiec:i\-e amiount or the compound, and a pnarm.aceuiicaily 
acceptable carrier or excipient. Such a carrier m.ciudes but is not i:miited to saline, 
buffered saline, dextrose, water. g:>-ceroL ethanol. and combinations thereof. The 
tormulation should suit the mode of administration. 

The products of the in\-ention as well as an>- acti\-ator5 and deacti\'ators 
coiTiPounds (see below) which are polypeptiaes. may also ce employed in 
accordance with the present invention b>- expression of such poh-peptides /;? vivo. 
which is often referred to as "gene therapy/' Cells from a patient may be 
engmeered with a nucleic acid sequence iDNA or RNA) encoding a pohpeptide 
ex vivo, with the engi:neered cells then being provided to a patient to be treated 
with the polypeptide. Such methods are well-kmown in the an. For example, cells 
ma\ be engineered by procedures known in the art by use of a retroviral particle 
containing RNA encoding a polypeptide of the present invention. 

Similarly, cells may be engineered in vivo for expression of a polypeptide 
in VIVO by procedures known in the art. As knowm in the art, a producer cell for 
producing a retroviral panicle containing RNA encoding the polypeptide of the 
present invention may be administered to a patient for engineering cells in vivo 
and expression of the polypeptide in vivo. These and other methods for 
administering a product of the present invention by such method should be 
apparent to those skilled in the art from the teachings of the present invention. 
For example, the expression vehicle for engineering cells may be other than a 
retrovirus, for example, an adenovirus which may be used to engineer cells in 
vivo after combination with a suitable delivery vehicle. 

Retroviruses from which the retroviral plasmid vectors mentioned above 
mav be derived include, but are not limited to, Moloney Murine Leukemiia Virus, 
spleen necrosis virus, retroviruses such as Rous Sarcoma Vims. Har\'ey Sarcoma 
Virus, avian leukosis virus, gibbon ape leukemia virus, human immunodeficiency 
virus, adenovirus. Vlyeloproliferative Sarcoma Virus, and mammar\' tumor virus. 
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The retro\'iral plasmid vector is employed to transduce packaging cell 
lines to form producer cell lines. Examples of packaging cells which may be 
transfecied include, but are not limited to, the PE50L PAS 17, psi-2, psi-AM, 
PA 12. TI9~14X\ VT-19-17^H2, psi-CRE, psi-CRIP, GP-E'86. GP-^envAmi2, 

5 and DAM cell lines as described in Miller {Human Gene Therapy. Vol. 1_. pg. 
5-14. (1990)). The vector may transduce the packaging cells through any means 
known in the art. Such means include, but are not limited to. electroporation, the 
use of liposomes, and CaP04 precipitation. In one alternative, the retroviral 
plasmid vector may be encapsulated into a liposome, or coupled to a lipid, and 

10 then administered to a host. 

The producer cell line generates infectious retroviral \ector particles 
which include the-nucleic acid sequence(s) encoding the polypeptides. Such 
retroviral vector particles then may be employed, to transduce eukaryotic cells, 
either in vitro or in vivo. The transduced eukaryotic cells will express the nucleic 

15 acid sequence(s) encoding the polypeptide. Eukaryotic cells which may be 
transduced include, but are not limited to, embryonic stem cells, embr}^onic 
carcinoma cells, as well as hematopoietic stem cells, hepatocytes, fibroblasts, 
myoblasts, keratinocytes, endothelial cells, and bronchial epithelial cells. 

The genes introduced into cells may be placed under the control of 

20 inducible promoters, such as the radiation-inducible Egr-1 promoter, (Maceri, 
H.J.. et al, Cancer Res., 56(19):4311 (1996)), to stimulate CLH production or 
antisense inhibition in response to radiation, eg., radiation therapy for treating 
ramors. 

25 Example 11. CLH product 

The substantially purified CLH product of the invention has been defined 
above as the product coded from the nucleic acid sequence of the invention. 
Preferably t he amin o acid sequence is an amino acid sequence having at least 
70%, preferably at least 80% or 90% identit>^ to the sequence identified as any 
30 one of SEQ ID NO: 5 to SEQ ID NO: 8. The protein or polypeptide may be in 
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::^ar.:re ana or mocinea :onr.. a:so a> cerinec aco\'e. Also con:e:::platec are 
protein traaments ha\'ing ai leas: 10 contiguous ammo acid residues. prererabK' at 
leas: :'?-20 residues. deri\'ed from the CLH produci. 

The seauence \-anaLions are rrererabh' those that are considered conscp/ed 

- substitutions- as denned abo\'e. Thus, tor examr^ie. a protein with a sequence 
ha\ ing at least 80^ o sequence identity' with the protein identified as an>" one of 
SEQ ID NO: 5 to SEQ ID NO: 8. preferably by utihzing con5er\-ed substitutions 
as detlned above is also pan of the in\'ention. In a more specific embodiment, the 
protein has or contains the sequence identified as an\' one of SEQ ID NO: 5 to 

10 SEQ ID NO: 8. The CLH product ma>" be (i) one in which one or more of the 
amiino acid residues in a sequence listed abo\'e arc substituted 'A'ith a consen'ed or 
non-consened amino-acid residue (preferabK* a conser\*ed amino acid residue), 
or (ii ) one in which one or more of the amino acid residues includes a substituent 
group, or (iii) one in which the CLH product is fused with another compound, 

if such as a compound to increase the half-life of the protein (for example. 
poi> eth\iene glycol (PEG)), or a moiet\' which ser\'es as targeting means to direct 
the protein to its target tissue or target cell population (such as an antibody), or 
(i\-^ one in which additional amino acids are fused to the CLH product. Such 
fragments, variants and derivatives are deemed to be w^ithin the scope of those 

:o skilled in the art from the teachings herein. 

A. Preparation of CLH product 

Recombinant methods for producing and isolating the CLH product and 
fragments of the protein are described above. 

25 In addition to recombinant production, fragments and ponions of CLH 

product may be produced by direct peptide synthesis using solid-phase techniques 
(cf. Stewart er aL, (1969) Solid-Phase Peptide Synthesis, WH Freeman Co, San 
Francisco: Merrifield J.. J. Am. Chem. Soc, 85:2149-2154. (1963)). In vitro 
peptide synthesis may be performed using manual techniques or by automation. 

30 Automated synthesis may be achieved, for example, using Applied Biosystems 



451 A Peptide Synthesizer fPerkin Elmer, Foster Cit}\ Calif.) in accordance with 
the instructions provided b>' the manufacturer. Fragments of CLH product may be 
chemically synthesized separately and combined using chemical methods to 
produce the full length molecule. 

B. Therapeutic uses and compositions utilizing the CLH product 

The CLH oroduct of the invention is senerallv useful in treatins diseases 
i. ' — ■ — 

and disorders which are characterized by a lower than normal level of CLH 
expression, and or diseases which can be cured or ameliorated by raising the level 

10 of the CLH product, even if the level is normal. 

Typically diese diseases are in CLH products or fragments and may be 
administered by any of a number of routes and methods designed to pravide a 
consistent and predictable concentration of compound at the target organ or 
tissue. The product-containing compositions may be administered alone or in 

15 combination with other agents, such as stabilizing compounds, and/or in 
combination with other pharmaceutical agents such as drugs or hormones. 

CLH product-containing compositions may be administered by a number 
of routes including, but not limited to oral, intravenous, intramuscular, 
transdermal, subcutaneous, topical, sublingual, or rectal means as well as by nasal 

20 application. CLH product-containing compositions may also be administered via 
liposomes. Such administration routes and appropriate formulations are generally 
known to those of skill in the art. 

The product can be given via intravenous or intraperitoneal injection. 
Similarly, the product may be injected to other localized regions of the body. The 

25 product may also be administered via nasal insufflation. Enteral administration is 
also possible. For such administration, the product should be formulated into an 
appropriate capsule or elixir for oral administration, or into a suppository for 
rectal administration. 

The foregoing exemplary administration modes will likely require that the 

30 product be formulated into an appropriate carrien including ointments, gels. 



:;::r:^o-iLor:es. Arrrorriate ibnrulation? are weil ivnown to persons skillec in the 

r-' 

Dosage of :he rrociie: wall \'ar>". depending upon :he poieney and 
iheraneutie index of ine panicuiar poi>'oepiide seiected. 

A therapeutic eomposidon for use m the treatment method can include the 
product in a sterile injectable solution, the poh'peptide in an oral deli\er}- \-ehicle- 
the rroduct m an aerosol suitable tor nasal acministration. or the proGuct in a 
nebulized fonri. all prepared according to weil known m.ethoJs. Such 
compositions comiprise a therapeutical]}- effecti\-e amiount of :he comipound. and a 
pharmiaceutically acceptable carrier or excipient. Such a carrier includes but is not 
limnted to saline, buffered sah.ne. dextrose water, glycerol, ethanol. and 
com.binations thereof.. The product of the invention may also be used to modulate 
endothelial differentiation and prolii^eration as well as to modulate apoptosis . 
either ex vivo or in vitro, for example, in cell cultures. 

Example IIL Screening methods for activators and deactivators (inhibitors) 

The present invention also includes an assay for identifying molecules, 
such as synthetic drugs, antibodies, peptides, or other molecules, which have a 
modulating effect on the activity of the CLH product, e.g. activators or 
deactivators of the CLH product of the present invention. Such an assay 
com^prises the steps of providing an CLH product encoded by the nucleic acid 
sequences of the present invention and determining its physiological activit\' on 
the target in the presence and absence of one or more candidate molecules to 
determine the candidate molecules. Those molecules which are modulating effect 
on the activity of the CLH product are selected as likely candidates for activators 
and deactivators. 

CLH product, its catahiic or immunogenic nragments or oligopeptides 
thereof, can be used for screening therapeutic compounds in any of a variet>' of 
drug screening techniques. The fragment employed in such a test may be free in 
solution, affixed to a solid support, borne on a cell membrane or located 



intracellularly. The fonnation of binding complexes, between CLH product and 
the asent beins tested, mav be measured. Altemati\'elv. the activator or 
deactivator may work by serv ing as agonist or antagonist, respectively, of the 
CLH receptor and their effect may be determined in connection with the receptor. 

5 Another technique for drug screening which may be used provides for 

high throughput screening of compounds having suitable binding affmit}' to the 
CLH product is described in detail by Geysen in PCT Application WO 84/03564. 
published on Sep. 13, 1984. In summar/, large numbers of different small 
peptide test compounds are synthesized on a solid substrate, such as plastic pins 

10 or some other surface. The peptide test compounds are reacted with the full CLH 
product or \\\lh fragments of CLH product and washed. Bound CLH product is 
then detected by methods well known in the art. Substantially purified CLH 
product can also be coated directly onto plates for use in the aforementioned drug 
screening techniques. Alternatively, non-neutralizing antibodies can be used to 

15 capmre the peptide and immobilize it on a solid suppon. 

Antibodies to the CLH product, as described in Example IV below, may 
also be used in screening assays according to methods well know^n in the art. For 
example, a sandwich'' assay may be performed, in which an anti-CLH antibody 
is affixed to a solid stirface such as a microtiter plate and CLH product is added. 

20 Such an assay can be used to capture compounds which bind to the CLH product. 
Alternatively, such an assay may be used to measure the ability of compounds to 
influence with the binding of CLH product to the CLH receptor [ I and then 
select those compounds which effect the binding. 

25 Example IV. Anti-CLH antibodies 
A. Synthesis 

In still another aspect of the invention, the purified CLH product is used to 
produce a nti-CLH antibodies which have diagnostic and therapeutic uses relate d 
to the activity, distribution, and expression of the CLH product in particular 
30 therapeutic applications in modulating the effect of CLH on BMP proteins. 



Aniib»^die? :o CLK procuci ma\ be generated by nierhods \^'e!l Ivjiown in 
:he an. Such anybodies may inciude. but are not limited to. poNxional. 
monoclonal, chimeric, humanized, single chain. Fab tbagmenis and fragments 
rroduced b}' an Fab expression hbrar>\ Antibodies, i.e.. those 'v\hich irihibit 
din:er lormation. are especiaih' preferred for therapeutic use. 

A fragmient CLH product for antibod\- induction does not require 
biological activity but ha\'e to feature imimunological acti\"in": howe\-er- the 
protein fragment or oligopeptide must be antigenic. Peptides used to induce 
specific antibodies m.a\" have an amino acid sequence consisting of at least fi\-e 
amino acids, preferabh' at least 10 amino acids of the sequences specified in SEQ 
ID VO: 2. Preferabl}' the>' should mimic a ponion of the amino acid sequence of 
the natural protein _and may contain the entire amino acid sequence of a small, 
naruralh* occurring molecule. Shon stretches of CLH protein amino acids may be 
fused with those of another protein such as keyhole limpet hemocyanin and 
antibod}' produced against the chimeric molecule. Procedures well known in the 
an can be used for the production of antibodies to CLH product. 

For the production of antibodies, various hosts including goats, rabbits, 
rats. mice, etc may be immunized bv injection with CLH product or any ponion. 
fragment or oligopeptide which retains immunogenic propenies. Depending on 
the host species, various adjuvants may be used to increase immunological 
response. Such adjuvants include but are not limited to Freund's, mineral gels 
such as aluminum hvdroxide, and surface active substances such as lysolecithin, 
pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, and dinitrophenol. BCG (bacilli Calmette-Guerin) and 
Corvmebacteriumi parMam are potentially useful human adjuvants. 

Monoclonal antibodies to CLH protein may be prepared using any 
technique w^hich provides for the production of antibody molecules by continuous 
cell lines in culture. These include but are not limited to the hybridoma technique 
originally described by Koehler and Milstein {Nature 256:495-497, (1975)), the 
human B-cell hybridoma technique (Kosbor et aL, Immunol. Today 4:72, (1983); 
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Coie ei ai, Proc. NarL Acad ScL 80:2026-2030. (1983)) and the EBV-hybridoma 
technique (Cole, etoL, MoL Cell Biol 62:109-120. (1984)). 

Techniques developed for the production of "chimeric anribodies'\ the 
spHcing of mouse antibody genes to human antibody genes to obtain a molecule 

f with appropriate antigen specificir>' and biological activit}* can also be used 
(Morrison er al., Proc, NatL Acad. Sci. 81:6851-6855, (1984"): Neuberger et al.. 
Nature 312:604-608. (1984); Takeda et al, Nature 314:452-454. (1985)). 
Alternatively, techniques described for the production of single chain antibodies 
(U.S. Pat. No. 4,946,778) can be adapted to produce single-chain antibodies 

10 specific for the CLH protein. 

Antibodies may also be produced by inducing in vivo production in the 
lymphocyte population or by screening recombinant immunoglobulin libraries or 
panels of highly specific binding reagents as disclosed in Orlandi et aL {Proc. 
Nad. Acad. Sci. 86:3833-3837, 1989)), and Winter G and Milstein C. (Nature 

15 342*293-299, (1991)). 

Antibody fragments which contain specific binding sites for CLH protein 
may also be generated. For example, such fragments include, but are not limited 
to, the F(ab')2 fragments which can be produced by pepsin digestion of the 
antibody molecule and the Fab fragments which can be generated by reducing the 

20 disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression libraries 
may be constructed to allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity (Huse W.D. et aL, Science 256:1275-1281, 
(1989)). 

25 B. Diagnostic applications of antibodies 

A variety of protocols for competitive binding or immunoradiometric 
assays using either polyclonal or monoclonal antibodies with established 
specificities ar^-well known in the art. Such immunoassaysJypically involy-eJttie^ 
formation of complexes between CLH product and its specific antibody and the 
30 measurement of complex formation. A two-site, monoclonal-based immunoassay 



:::cnoc:ona: aniircdies reaci:\'e :c ?.vo noninierienng eritopes on a 
^re^illc CLH procuct is prefen"ec. ru: a con:pen::\"e binding assa}" nia\' also be 
en:pio>ed. Tiiese assay's are described m Maddox D.E.. c* a^.. \J. Exp. \kd. 
158:1:11. i 1983)). 

Antibodies which specincaiK' bind CLH produci are useful for the 
diagnosis of conditions or diseases characterized by o\'er or under expression of 
ChH. Altemati\"eiy. such antibodies ma>' be used in assa>"5 to monitor patients 
being treated with CLH product, its acti\'aLors. or its deaciiwators. Diagnostic 
assa>"s for CLH protein include methods utilizing the antibod}* and a label to 
detect CLH product in human bod> fluids or extracts of cells or tissues. The 
prL ducts and antibodies of the present :n^■ention ma}' be used ^^■:th or \\':ihout 
modification. Frequently, the proteins and antibodies will be labeled by joining 
them., either coxalenth" or nonco\alenth'. with a reporter molecule. A wide 
variet\' of reporter molecules are know n in the art. 

A \'ariet\' of protocols for mieasuring CLH product, using either polyclonal 
or monoclonal antibodies specific for the respective protein are known in the art. 
Examples include enzyme-linked immunosorbent assay (ELISA), 
radioimmunoassay (RIA). and fluorescent activated cell sorting (FACS). As 
noted above, a two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on CLH product is preferred, 
but a competitive binding assay may be employed. These assays are described, 
among other places, in Maddox. ei al (supra). Such protocols provide a basis for 
diagnosing altered or abnorm.al levels of CLH product expression. Normal or 
standard values for CLH product expression are established by combining body 
or cell extracts taken from normal subjects, preferably human, with antibody to 
CLH product under conditions suitable for complex formation which are well 
known in the art. The amount of standard complex formation may be quantified 
by \'arious methods, preferably by photometric m^ethods. Then, standard values 
obtained from normal samples may be comipared with values obtained fi-om 
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samples from subjects potentially affected by disease. Deviation between 
standard and subject values establishes the presence of disease state. 

The antibody assays are useful to determine the le\*el of CLH present in a 
body fluid sample, in order to determine whether it is being o\ erexpres5ed or 
5 underexpressed in the tissue, or as an indication of how CLH le\*els are 
responding to drug treatment. 

Another alternative is to determine the presence and^'or le\*el of naturally 
occurring anti-CLH antibodies in a sample, such as blood or serum. Many times 
diseases are identified by detecting the presence or level of antibodies against a 
10 specific product. For the detection of such naturally occurring anti-CLH 
antibodies, the sample may be contacted with the product of the im ention, for 
example as depicted in any one of SEQ ID NO: 5 to SEQ ID NO: 8, or with an 
antigenic fragment thereof, and the presence or level of antibody-antigen 
complexes may be determined by methods well known in the an. 

15 

C. Therapeutic uses of antibodies 

In addition to their diagnostic use the antibodies may have a therapeutical 
utility in blocking or decreasing the activity of the CLH product in pathological 
conditions where beneficial effect can be achieved by such a decrease. 

20 The antibody employed is preferably a humanized monoclonal antibody, 

or a human Mab produced by known globulin-gene library methods. The 
antibody is administered typically as a sterile solution by IV injection, although 
other parenteral routes may be suitable. Typically, the antibody is administered 
in an amount between about 1-15 mg/kg body weight of the subject. Treatment is 

25 continued, e.g., with dosing every 1-7 days, until a therapeutic improvement is 
seen. 

Although the invention has been described with reference to specific 
methods and embodiments, it is appreciated that various modifications and 
changes may be made without departing from the invention. 



CLAIMS: 

1. An isolated nucleic acid sequence selected irom the group consisting of: 
li) liie nucieic acid sequence depicted in an>" one of SEQ ID \0: 1 to 

SEO ID NO: 4: 

(ii) nucleic acid sequences ha\'ing at least "O^o identit}' with the 
sequence of (i): and 

( iii ) fragments of ( i) or (iij of at least 20 b.p. 

2. A nucleic acid sequence according to Claim l(ii) wherein the nucleic acid 
sequences ha\e at least 80° o identir/ with the sequence of Claim l^i). 

10 3. A nucleic acid sequence according to Claim 2. wherein the nucleic acid 
sequences have at least 90° b identit}'. 

4. An isolated nucleic acid sequence complementar>" to the nucleic acid 
sequence of Claim 1. 

5. .An amino acid sequence selected from the group consisting of: 

15 (i) an amino acid sequence coded by the isolated nucleic acid sequence 

of Claim 1; 

(ii) fragments of the amino acid squence of (i) having at least 10 amino 

acids: 

(iii) analogues of the amino acid sequences of (i) or (ii) in w^hich one or 
20 more amino acids has been added, deleted, replaced or chemically modified 

without substantially altering the biological activity of the parent amino acid 
sequence. 

6. An amino acid sequence according to Claim 5, as depicted in any one of 
SEQ ID NO: 5 to SEQ ID NO: 8. 

25 7. An isolated nucleic acid sequence coding for the amino acid sequence of 
Claim 5 or 6. 

8. A purified antibody which binds specifically to the amino acid sequence of 
Claim 5 or 6. 
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9. An expression vector comprising the nucleic acid sequences of Claim 1 or 7 
and control elements for the expression of the nucleic acid sequence in a suitable 
host. 

10. Am expression vector comprising the nucleic acid sequence of Claim 4, and 
5 control elements for the expression of the nucleic acid sequence in a suitable host. 

11. A host cell transfected by the expression vector of Claim 9 or 10. 

12. A pharmaceutical composition comprising a phamiaceutically acceptable 
carrier and as an active ingredient an agent selected from the group consisting of: 

Ci) the expression vector of Claim 9; and 
10 (ii) the amino acid sequence of Claim 5 or 6. 

13. A pharmaceutical composition according to Claim 12, for treatment of 
diseases which can" be ameliorated, cured or prevented by raising the level of the 
Chordin-Like-Homolog (CLH). 

14. A pharmaceutical composition comprising a phamiaceutically acceptable 
15 carrier and as an active ingredient an agent selected from the group consisting of: 

(i) the nucleic acid sequence of Claim 4: 

(ii) the expression vector of Claim 10; and 

(iii) the purified antibody of Claim 8. 

15. A pharmaceutical composition according to Claim 14, for treatment of 
20 diseases which can be ameliorated or cured by decreasing the level of the CLH 

product. 

16. A pharmaceutical composition according to Claim 12 or 14, for regulating 
the levels of bone morphcgenic proteins (BMP). 

17. A pharmaceutical composition according to Claim 16 for regulating the 
25 levels of BMP-2 or BMP-4. 

18. A method for detecting an CLH nucleic acid sequence in a biological 
sample, comprising the steps of: 

(a) - hybridizing to nucleic- acid ^material . of said_bioiQgicai_sampie_a 

nucleic acid sequence of Claim 1 or 4; and 
30 (b) detecting said hybridization complex: 



^\"hcrein :he rresence of said h}bridization complex correlaies ^^"ilh the 
presence of an CLH nucieic acid sequence in the said biological sample. 

19. A mefnod according to Claim 18. wherein the nucieic acid miatenal of said 
bioiogicai sam.pie are mRN'A transcripts. 

20. A method according to Claim 18. w here the nucleic acid sequence is present 
in a nucleic acid chip. 

21. A method for identif>*ing candidate compounds capable of binding to the 
CLH product and modulating its activit:c the m.ethod comprising: 

(i) pro\iding a protein or polypeptide comprising an amino acid 
sequence substantially as depicted in any one of SEQ ID NO: 5 to SEQ ID NO: 8. 
or a fragment of such a sequence: 

(ii) comparing the physiological effect of the CLH product in the 
absence and presence of said candidate compound and selecting those compounds 
which show a significant effect on said physiological activity. 

22. A method according to Claim 21. wherein the compound is an activator and 
the measured effect is increase in the physiological activit\'. 

23. A method according to Claim 21, wherein the compound is an deactivator 
and the effect is decrease in the physiological activity. 

24. An activator of the amino acid sequence of Claim 5 or 6. 

25. An deactivator of the amino acid sequence of Claims 5 or 6. 

26. A method for detecting CLH-product in a biological sample, comprising the 
steps of: 

(a) contacting with said biological sample the antibody of Claim 8, 
therebv formins an antibodv-antigen complex; and 

(b) detecting said antibody-antigen complex 

w^herein the presence of said antibody-antigen complex correlates with the 
presence of CLH product in said biological sample. 

27. A method for detecting anci-CLH antibodies in a biological sample 
comprising the steps of: 



(a) contacting with said biological sample the antibody of Claim 5 or 6, 
thereby forming an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

wherein the presence of said antibody-antigen complex correlates with the 
presence of anti-CLH antibody in said biological sample. 



For the Applicants. 



PJEHSHOLD COHN AND RARTNERS 

Bv: 
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< 



< : c : > - 

<!":> Pa^enclr. Ver 

< 2 1 : > 1 
<2:i> 12S1 
<212> 

< 4 C C> 1 



actcccr qc-ctatiacc zgctcagagg gcgccca-g 



6C 



ttac zqcczccac:. ctccgccrgr ccaczccccc cagcctg"ga cggagccaca _20 



gcaatgcrg- cccaacrgtg tggaacctca caczccczcz gcacrccggg ccccaccaaa 160 
gtcctgccag cacaacggga ccatgtacca acacggagag atcntcagug cccatgagcr 240 
gttccccrcc cgcctgccca accagtgtgt cctctgcagc tgcacagagg gccagatcta 300 
ctgcggccrc acaaccrgcc ccgaaccagg crgcccagca ccccrcccgc tgccacacrc 560 
ctgctgccaa gccrgcaaac atgaggcaag tgagcaaccg gatgaagagg acagrgrgca 420 
gtcgctccar ggggtgagac atcctcagga tccatgttcc agtgatgctg ggacaaagag 480 
aggcccgggc accccagccc ccactggcct cagcgcccct ctigagcrtca tccctcgcca 540 
ctitcagaccc aagggagcag gcagcacaac tgtcaagatc gtcctgaagg agaaacaran 600 
gaaagccrgr gtgcatggcg ggaagacgta ctcccacggg gaggtgtggc acccggcctt 660 
ccgtgccttc ggcccttgcc catgcarcct atgcacctgr gaggatggcc gccaggacrg 720 
ccagcgtg-g acctgtccca cgaagtaccc ctgccctcac ccccagaaag tggctgggaa 780 
gtigctgcaag attcgcccag aggacaaayc agaccctggc cacagtgaga tcagttczac 840 
caggtgtccc aaggcaccgg gccgggtcct cgtccacaca tcggta-_ccc caagcccaga 900 
caacctgcgt cgctzzqccc tggaacacga ggcctcggac ttggrggaga tctacctctg 960 
gaagctggra aaagatgagg aaactgaggc tcagagaggt gaagtacctg gcccaaggcc 1020 
acacagccag aacttccact tgactcagat: caagaaagrc: aggaagcaag actrccagaa 1080 
agaggcacag cacrtccgac rgctcgctgg cccccacgaa ggtcacrgga acgUcc-cct 1140 
agcccagacc crggagctga aggtcacggc cagrccagac aaagrgacca agacaraaca 1200 
aagacctaac agttgcagat atgagctgta taattgttgt tattatatar taataaataa 1260 
gaagttgcat aaccatcaaa a *^ 



<210> 2 
<211> 1722 
<212> D^IA 



1 



< C 0 > 2 



a w v_ ^ w ^ u 


^ t_ L> c w g w 


cccctctccc 


W ^ W> ^ \^ NiA 


ccttccttcc 

■ml 


tctctccauc 


50 




u tccccgcgr 


"cctctntcca 


CCttTLCCCcL. 


cttcccaccu 


'cagacc^ccc 


120 


W 0 W ^ C ' 


cct'C'Ccc'cgc 


ccacccctcrc 


t ucct ggccc 


t~ ctccgacc 


ccGc~c^ ace 


ISO 




c nggggtca t 


gcgggzrgat 


W ^ CT ^ 


tgtgnct ccG 


tccccfct." c 


240 


Qu CtCCCC wC 


ctcccgactc 


cgctcccgga 


ccaccggcct 


gaccctgggg 


o c £ CT o. O 


300 


■j- «^ ^ J— r — y-»f /— r » »^ 
^ V>r ^ & \^ w ^ 


aaactcctct 

^ ^ 


cctccttact 


CGCactCGCG 


ctgctctggt 


tccccc -gga 


3 50 


ct cccacgcz 


cgagcccgcc 






catigggaaga 




420 


cqoccacarc 


t accaccc ct 


c«. ^ O O O w 


^ ^ ^ CZ- ^ C7 


a tctac -Gcc 




48? 


C"LOCtCoC5.C# 


ggcgcccatg 


■cgagfcgcta 


ccccc rccac 


cgtccgcctg 


tccactgccc 


5 40 


ccagcctgt g 


acggagccac 


agcaatgctg 


tcccaactat 


gtggaacctc 


acaccccc'cc 


600 


tqcacTiccca 


gccccaccaa 


agtcctgcca 


gcacaacggg 


acca tgtacc 


aacacggaga 


560 


ca tct::cacr 


accca taacc 

^ — ^ 


tgi: tcccc tc 




aaccagtgtg 


tcccctgcag 


720 


ccgcacagag 


ggccagatct 


actgcggcct 


cacaacctgc 


cccgaaccag 


gcrgcccagc 


730 


acccctcccg 


ctgccagact 


cc t gctgcca 


agcctgcaaa 


gatgaggcaa 


gtgagcaa tc 


340 


ggatgaagag 


gacagtgtgc 


agtcgctcca 


tggggrgaga 


ca tcctcagg 


atcca tgt r c 


900 


cagtga tgct 


gggagaaaga 


gaggcc<:ggg 


caccccagcc 


cccactggcc 


tcagcgcccc 


960 


tctgagct uc 


a tccc tcgcc 


acttcagacc 


caagggagca 


ggcagcacaa 


ctgtcaagat 


1020 


cgtcctgaag 


gagaaacata 


ngaaagcctg 


tgtgcatggc 


gggaagacgt 


actcccacgg 


1080 


ggaggtgtgg 


cacccggcct 


tccgtgcctt 


cggcccttgc 


ccatgca tec 


tatgcaccrg 


1140 


tgaggatggc 


cgccaggact 


gccagcgtgt 


gacctgtccc 


acgaagtacc 


cctgccgtca 


1200 


ccccgagaaa 


gtggctggga 


agtgctgcaa 


gatttgccca 


gaggacaaag 


cagaccctgg 


1260 


ccacagtgag 


atcagttcta 


ccaggtgtcc 


caaggcaccg 


ggccgggtcc 


tcgtccacac 


1320 


atcggtatcc 


ccaagcccag 


acaacctgcg 


tcgctttgcc 


c tggaacacg 


aggcctcgga 


1380 


cttggtggag 


atctacctct 


ggaagctggt 


aaaagatgag 


gaaactgagg 


ctcagagagg 


1440 


tgaagtacct 


ggcccaaggc 


cacacagcca 


gaatttccac 


ttgactcaga 


tcaagaaagt 


1500 


caggaagcaa 


gacttccaca 


aagaggcaca 


gcacttccga 


ctgctcgctg 


gcccccacga 


1560 


aggtcactgg 


aacgtcttcc 


ragcccagac 


cctggagctg 


aaggtcacgg 


ccagtccaga 


1620 


caaagtgacc 


aagacataac 


aaagacctaa 


cagttgcaga 


tatgagctgt 


ataattgttg 


1680 


ttattatata 


ttaataaata 


agaagttaca 


taaccatcaa 


aa 




1722 



<210> 3 
<211> 1515 
<212> DMA 
<213> Humanus 

<400> 3 

agaacagtgc ctggnactga gaagtgctnc atgaaccctc attaaaatgc tggatgaact 60 

aactcgggcc ccactgagcg gaagctgagg ctgccggctt taccttcttg ctctggttgg 120 

ggctggccca gacactgagg gggctggagg gctgtggtag aggtcatggg agggagggac 180 

tcagtcagat gtaggtatca gagggacctc ttagttagct gataagggga atggctggca 240 
aggcccaggc cagagc g-^t ttaaatatc aagctgg gg t -oa a a-tg c^^^— a4- g at c a^^^~3 0 0- 

aagtggcctt gttaatttca gcaaagattc acatgaaacc tcattttctt cttcctcctg 360 

cccctccccc actgtggaac ctcacactcc ctctggactc cgggccccac caaagtcctg 420 

ccagcacaac gggaccatgt accaacacgg agagatcttc agtgcccatg agctgttccc 480 



2 




<21Q> 4 
<211> 16 rC 
<212> DNA 
<213> H'-L- = nu£ 



<400> 4 














tagnancgr.c 


cgccaggggn 


tgnattcgcc 


ctractcann 


a tagggntna 


acggccgccc 


60 


ggncaggtigc. 


caaa t ggaca 


aataaaagga 


aacaagcatig 


attgtnaggn 


cagaggaggg 


120 


tggnatcnag 


tcagaagacr 


ggtgcrgt ca 


tccc I c:cntc 


gtgact gacr 


tgctgtgt gg 


180 


ccntcagc" g 


tiaacTir accc 


tctctgggcc 


r caT:'cr gt ct 


aatca taa ta 


attaacgr tg 


240 


a taccatga I. 


ataaa tctgt 


acagcat't.'t.c 


actgctt ga t 


tccctaactg 


ccctgtgaga 


300 


taagcgt i_c:ia 


ggctcagaga 


cagtiggcar g 


cccagtgtt g 


racagtaagt 


gtgtggtaaa 


360 


gccgagat.t.c 


aaacrcagac 


cttctggccc 


ct tgccragg 


agagcatgcc 


cagtt gtcr a 


420 


gcagac 


r cttg^ctga 


gtggcccaga 


rgacaT:cr.ct 


Lt tagaccr a 


caaagaagga 


480 


gaaatgagac 


agggtctttg 


ggctggagcc 


tcctgggact 


aacatggcac 


tggtcggrtt 


540 


gccaggccca 


gacatgttct 


gccttttcca 


tgggaagaga 


tactcccccg 


gcgagagctg 


600 


gcacccczac 


L. tggagccac 


aaggccr ga t 


g^actgcc <-g 


cgctgtaccr 


gcrcagaggg 


660 


cgcccatgzg 


agtrgttacc 


gcctccacr g 


tccgcctigr c 


cactgccccc 


aacctar gac 


720 


ggagccacag 


caatgctgtc 


ccaagtgtgr 


ggaacctcac 


actccctctg 


gactccgggc 


780 


cccaccaaag 


tcctgccagc 


acaacgggac 


catgtaccaa 


cacggagaga 


tcttcagrgc 


840 


ccat gagccg 


t tcccctccc 


gcctgcccaa 


ccagtgtgtc 


ctctgcagct 


gcacagaggg 


soo 


ccagatcrac 


t gcggccrca 


caacctgccc 


cgaaccaggc 


tgcc::agcac 


ccctcccgct. 


960 


gccagact cc 


tgctgccaag 


cctgcaaaga 


tgaggcaagt 


gagcaatcgg 


atgaagagga 


1020 


cagt gtgcag 


t cgctccatg 


gggtgagaca 


tcctcaggat 


ccatgttcca 


gtgatgctgg 


1080 


gagaaagaga 


ggcccgggca 


ccccagcccc 


cactiggcctic 


agcgcccctc 


tgagctt cat 


1140 


cccccaccac 


tt cacaccca 


agggagcagg 


cagca caact 


gtcaagatcg 


tcctgaagga 


1200 


gaaacatang 


aaagcctigtig 


tgcat ggcgg 


gaagacgtac 


tcccacgggg 


aggtgtiggca 


1260 


1^ V-' ^ ^ta' ^ 


cgt gccttcg 


r~ ^ *~ rrf— r-*— 


a tGca tccta 


t gcacctgtg 


aaaa tqcrcca 


1320 







c o ^ c c c a c* 


caacti a cccc 


^4 " _ ~ 


ccciaca aa c ^ 
— - 


138C 


» S ^ 




"titiLLcrccciaca 

W 


ccacaaaGca 


caccctLCJCCC 

fcd^ 


acaccoacaw 

^ ^ «r 


144C 




G \^ ^.ff ^ u ^ W W 


4^ \^ \^ U ^ 


V«l «v W 
^ ^ ^ 


CT ^ C O 3^ C S C o. i*- 


0* 


150C 






^ ^ O W ^ <v 


W W 


acc^cdcacti. 

Vi* 




1560 


ccac c - c crc 




•IT J J ^ 




^ ' — ' 


^ I I- 

•rt. ^P* v« V — ' W 




cccaacgcca 


-—^ ^ >^ 

CdCdgcccca 


anccccacCw 


oacccagauC 




d ^ ^ d ^ 


' 6S0 


ccccca^aaa 


cacqcacacc 


acctccgact 


gcccgctggc 


ccccacgaag 


.-T t* p r* ^ ^ Z5 

w w <^ G ^ w ^ 


1740 


cgtcttccta 


gcccagaccc 


tggagctgaa 


ggtcacggcc 


agt ccagaca 


aagt gacc aa 


1800 


^■rf a ci. w c» d w d ^ 


acacccaaca 


O ^ w O w d CT c« L- 


Siw* kw« ^ ^ 






1860 


aauaaatiaa^ 


aac'C'cgca'ca 


acca'ccaaaa 








1S5C 



<210> 5 
<211> 395 
<212> PRT 
<213> Hurr.anus 



<400> 5 

Arg Glu Pro Gin 

Gly Ala :-:is Val 

20 

Pro Gin Pro Val 
35 

Pro His Thr Pro 
50 

Asn Gly Thr Met 
65 

Phe Pro Ser Arg 



Gly Leu Met Tyr 

5 - 

Ser Cys Tyr Arg 



Thr Glu Pro Gin 

40 

Ser Gly Leu Arg 
55 

Tyr' Gin His Gly 
70 

Leu Pro Asn Gin 
85 



Cys Leu Arg Cys 
10 

Leu His Cys Pro 
25 

Gin Cys Cys Pro 



Ala Pro Pro Lys 

60 

Glu lie Phe Ser 
75 

Cys Val Leu Cys 



Thr Cys Ser Glu 
15 

Pro Val His Cys 
30 

Lys Cys Val Glu 
45 

Ser Cys Gin His 



Ala His Glu Leu 

80 

Ser Cys Thr Glu 
95 



Gly Gin lie Tyr Cys Gly Leu Thr Thr Cys Pro Glu Pro Gly Cys Pro 

100 105 110 

Ala Pro Leu Pro Leu Pro Asp Ser Cys Cys Gin Ala Cys Lys Asp Glu 
115 120 125 

Ala Ser Glu Gin Ser Asp Glu Glu Asp Ser Val Gin Ser Leu His Gly 
130 135 140 



Val Arg His Pro Gin Asp Pro Cys Ser Ser Asp Ala Gly Arg Lys Arg 
145 150 155 160 



4 



i— ^ ^ 



5 _ _ e _ c - 



4_ W W 



" C 
^ ^ ^ 



Val Ala 31 



ys Cys ^ys ^^e Cys 

Z € 5 



Pre Glu Asp Lys Ala 



'-^ " 



G H s S 3 



lie Ser Ser 



280 



Cvs Pre Lvs Ala Pro 



285 



Val Leu 
250 



"al His Thr Ser Val Ser Pre Ser Pro Asp Asn Leu Arg Arg 



G C 



300 



Phe Ala Leu Glu His Glu Ala Ser Asp Leu Val Glu lie Tyr Leu Tr--- 



305 



0 



315 



^ r-^ 

C ^L ^^ 



Lys Lsu Val Lys Asp Glu Glu Thr Glu Ala Gin Arg Gly Glu Val Pre 

325 330 335 

Gly Pro A^g Pro Kis Ser Gin Asn Phe H^s Leu Thr Gin He Lys Lys 

340 345 350 



Val Arg Lys Gin Asp -he 



Gin 



Lys Glu Ala Gin His Phe Arg Leu Leu 
360 365 



Ala Gly Pro His Glu Gly His Trp Asn Val Phe Leu Ala Gin Thr Leu 



370 



375 



380 



Glu Leu Lys Val Thr T^^a. Ser Pro Asp Lys Val Thr Lys Thr 
385 390 395 



<210> 6 



5 



<211> 539 
<212> ?RT 
<213> H'Jirr.anuH 

<4C0> 6 

Ser Pre leu Pre 5er Ala 31y Pre Ser ?r.e Val Ser Pre 5er leu Pre 
1 5 10 15 

Pro Phe Pro .-ILa Phe Ser Phe His Leu Ser Leu Leu Pre Thr Leu Asc 

20 25 30 

Leu Pro Ser Cys Pro Pro Phe Leu Pro Thr Ala Ala Ser Trc Pro Phe 
3 5 4 0 4 5 

Ser Asp Pro Ala Leu Ala Ala Asp Leu Leu Gly Ser Cys Gly Leu lie 
50 55 60 

Cys Gly Pro Cys Xaa Ser Val Ser Phe Ser Ser Pro Val Leu Pro Thr 
65 70 75 80 

Pro Leu Pro Asp Gin Arg Pro Asp Pro Gly Glu Arg Met Val Pro Glu 

85 90 S5 

Val Arg Val Leu Ser Ser Leu Leu Gly Leu Ala Leu Leu Trp Phe Pro 

100 105 110 

Leu Asp Ser His Ala Arg Ala Arg Pro Asp Met Phe Cys Leu Phe His 
115 120 125 

Gly Lys Arg Tyr Ser Pro Gly Glu Ser Trp His Pro Tyr Leu Glu Pro 
130 135 140 

Gin Gly Leu Met Tyr Cys Leu Arg Cys Thr Cys Ser Glu Gly Ala His 
145 150 155 160 

Val Ser Cys Tyr Arg Leu His Cys Pro Pro Val His Cys Pro Gin Pro 

165 170 175 

Val Thr Glu Pro Gin Gin Cys Cys Pro Lys Cys Val Glu Pro His Thr 

180 185 190 

Pro Ser Gly Leu Arg Ala Pro Pro Lys Ser Cys Gin His Asn Gly Thr 
195 200 205 

Met Tyr Gin His Gly Glu He Phe Ser Ala His Glu— Leu Phe P-xo Ser- 
210 215 220 

Arg Leu Pro Asn Gin Cys Val Leu Cys Ser Cys Thr Glu Gly Gin He 



6 



v£ ^er ^er .-^.si: -_a 



leu 5er £ rrc Leu :Der rr.e __e r-z .-.r:: 



^ o w 



His ?he Ar 



a Fro Lvs G^y Au 
325 ■ 



^'a Gly Ser Thr Thr Val Lys He Val Leu 

530 335 



3Lu _ s Hls ?laa —vs j~_i^a Cv's va_ 



40 



^ ^ ^ 



His Gly Gl 



Val Trp His Pro Ala ?he Arg Ala ?he Gly 

360 365 



Cys He Leu Cys Thr Cys Glu Asp Gly Arg Gin Asp Cys Gin Arc Val 
370 375 380 



Thr Cys 
385 



rc Thr Lys Tyr Pro Cys Arg His Pro Glu Lys Val Ala Gly 

390 3S5 400 



Lys Cys 



;ys He Cys Pro Glu Asp Lys 
-205 



^■'a Asp Pro Gly His Ser 
410 415 



Glu He Ser Ser Thr Arg Cys Pro Lys Ala Pro Gly Arg Val Leu Val 

420 425 430 

His Thr Ser Val Ser Pro Ser Pro Asp Asn Leu Arg Arg Pne Ala Leu 
435 440 445 

Glu His Glu Ala Ser Asp Leu Val Glu He Tyr Leu Trp Lys Leu Val 
450 455 460 

Lys Asp Glu Glu Thr Glu Ala Gin Arg Gly Glu Val Pro Gly Pro Arg 
465 470 475 480 



Pro His Ser Gin Asn Phe His Leu Thr Gin He Lys Lys Val Arg Lys 



7 



485 490 495 

Gin Asc ?he Gin Lvs Glu .~.ia Gin His ?he .-.rc Leu leu .^.la Gly ?rc 

500 5C5 51C 

His Glu Glv His Trp Asn Val ?he Leu Ala Gin Thr Leu Glu Leu Lys 
515 520 525 

Val Thr .-.la Ser Pre Asp Lys Val Thr Lys Thr 
530 535 



<210> 7 
<211> 388 
<212> PRT 
<213> HujT.anus 

<400> 7 

lie Ser Ser Trp Gly Gin Met Gin Asn His Gin Lys Ser Gly Leu Val 
15 10 15 

Asn Phe Ser Lys Asp Ser His Glu Thr Ser Phe Ser Ser Ser Ser Cys 

20 25 30 

Pro Ser Pro Thr Val Glu Pro His Thr Pro Ser Gly Leu Arg Ala Pro 
35 40 45 

Pro Lys Ser Cys Gin His Asn Gly Thr Met Tyr Gin His Gly Glu lie 
50 55 60 

Phe Ser Ala His Glu Leu Phe Pro Ser Arg Leu Pro Asn Gin Cys Val 
65 70 75 80 

Leu Cys Ser Cys Thr Glu Gly Gin lie Tyr Cys Gly Leu Thr Thr Cys 

85 90 95 

Pro Glu Pro Gly Cys Pro Ala Pro Leu Pro Leu Pro Asp Ser Cys Cys 

100 105 110 

Gin Ala Cys Lys Asp Glu Ala Ser Glu Gin Ser Asp Glu Glu Asp Ser 
115 120 125 

Val Gin Ser Leu His Gly Val Arg His Pro Gin Asp_ErjO_Cy^_Sjex_S£X_ 
130 135 140 

Asp Ala Gly Arg Lys Arg Gly Pro Gly Thr Pro Ala Pro Thr Gly Leu 
145 150 155 160 



8 



i 



^ ^ o 



^i.-. -^yC? r^_w 



Glu Asp lys Ala Asp Prc^ Gly Kis Ser Glu 

2 € 0 - 6 5 



le Ser Ser Thr Arg Cys 



Gly Arg Va^ 



leu '.'a_ r.is 



r Ser Val Ser ?rc Ser 

^1 c 5 



Fro Asc A-n leu Arg Arg ?he Ala leu Glu His Glu Ala Ser Asp Leu 
290 295 300 



Val Glu :ie Tyr Leu Trp Lys Leu /al ^ys ^sp 
305 310 315 



Val TVS Asc Glu Glu Thr Glu Ala 



Gin Arg Gly Glu Val Pro Gly Pro Arg Pro Kis Ser Gin Asn Phe His 

325 330 335 



Leu Thr Gin lie Lys Lys Val Arg Lys Gin Asp Phe Gin Lys 



340 



345 



u Ala 



350 



Gin His Phe Arg Leu Leu Ala Gly Pro His Glu Gly His Trp Asn Val 
3-5 360 365 

Phe Leu --ILa Gin Thr Leu Glu Leu Lys Val Thr Ala Ser Pro Asp Lys 
370 375 380 



Val Thr Lys Thr 
385 



9 



<210> ? 
<211> 43? 
<212> PRT 
< 2 1 5 > Hurr.a nu s 



<40G> 8 

AsTD Ara Val ?he Glv Leu Glu Pro Pro Giy Thr 
1 5 10 



Asn Met Ala Leu 



15 



Glv Leu 



Pro Gly 
20 



w ?ne 



25 



is Gly Lys 
30 



Ser Pro Glv Glu Ser Trp His Pro Tyr Leu Glu Pro Gin Gly Leu Me ^ 
35 40 45 



Tyr Cys leu Arg Cys Thr Cys Ser Glu Gly Ala His Val Ser Cys Tyr 

50 55 60 

Arg Leu His Cys Pro Pro Val His Cys Pro Gin Pro Val Thr Glu Pre 

65 70- - 75 80 



Gin Gin Cys Cys Pro Lys Cys Val Gl\i Pro His Thr Pro Ser Gly Leu 

85 90 55 



Arg Ala Pro Pro Lys Ser Cys Gin His Asn Gly Thr Met Tyr Gin His 

100 105 110 

Gly Glu lie Phe Ser Ala His Glu Leu Phe Pro Ser Arg Leu Pro Asn 
115 120 125 

Gin Cys Val Leu Cys' Ser Cys Thr Glu Gly Gin lie Tyr Cys Gly Leu 
130 135 140 

Thr Thr Cys Pro Glu Pro Gly Cys Pro Ala Pro Leu Pro Leu Pro Asp 
145 150 135 160 

Ser Cys Cys Gin Ala Cys Lys Asp Glu Ala Ser Glu Gin Sar Asp Glu 

165 170 175 

Glu Asp 5er val Gin Ser Leu His Gly Val Arg His Pro Gin Asp Pro 

180 185 190 

Cys Ser Ser Asp Ala Gly Arg Lys Arg Gly Pro Gly Thr Pro Ala Pro 
195 200 205 



Thr Gly Leu Ser Ala Pro Leu Ser Phe lie Pro Arg His Phe Arg Pro 
210 215 220 



10 





^ 1. 



? ^ c j-v s "c s n ^ s ''^ IT 
3 4C 



g Arc ?he Ala Leu 

-.4 5 



n 5 u 

350 



Ser 



.£ Ti; _ e U V a ^ w ^ - - 



Tyr Leu 



Trc Lvs Leu Val Lvs Asp Glu 



36 



^ '-J 



'hr Glu ;^^a Gin Arg Gly Glu Val Pro Gly Pre Arg Pro His Ser Gin 
3-0 375 380 



Asn ?he His Leu 
365 

Lvs Glu A^a Gin 



Thr Gin lie Lys 
390 

His ?he Ajrg Leu 
4C5 



Lys Val Arg Lys 

'I G 5 

Leu Ala Gly Pro 
410 



Gin Asp ?he Gin 

400 

His Glu Gly His 
415 



T As n 



Val Phe Leu Ala Gin Thr Leu Glu Leu Lys Val Thr 



420 



425 



430 



?-_La 



Ser 



Pro Asp Lys Val Thr Lys Thr 
435 



11 
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